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OBJECTIVES 

The  objective  of  this  focused  interdisciplinary  program  is  to  develop  a  microsensor- 
microelectronics-microactuator  (m^)  system,  which  can  execute  real-time  distributed  control  for 
reducing  the  surface  shear  stress  in  a  turbulent  boundary  layer. 

ACCOMPLISHMENTS 

Major  advances  have  been  made  in  the  following  areas; 

•  Sensors 

*  vacuum  insulated  shear  stress  sensor  array 

*  layered  structure  diaphragm  for  low  heat  conduction 

•  Actuator 

*  high  off-plane  displacement  and  large  force  electromagnetic  actuator 

*  in-plane  actuator  electrostatic  actuator 

•  Control  Logic 

*  neural  network  based 

*  physical  based 

*  model  based 

•  System 

Detailed  summaries  of  these  achievements  can  be  found  in  the  attached  reports. 
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MEMS-Based  Active  Drag  Reduction  in  Turbulent 

Boundary  Layers 

Thomas  Tsao,  Fukang  Jiang,  Chang  Liu,  Raanan  Miller,  Steve  Tung,  Jin- 
Biao  Huang,  Bhusan  Gupta,  Dave  Babcock,  Changhoon  Lee,  Yu-Chong  Tai, 
Chih-Ming  Ho,  John  Kim,  and  Rod  Goodman 

Abstract 

Drag  reduction  is  a  problem  of  great  interest.  From  a  practical  point  of  view,  any 
object  moving  in  a  fluid  experiences  drag.  When  the  objects  in  question  are  man-made 
vehicles,  and  the  drag  experienced  by  such  vehicles  results  in  increased  fuel  costs  and 
decreased  operating  efficiencies,  the  effort  spent  to  understand  and  reduce  drag  becomes 
an  extremely  worthwhile  endeavor.  Indeed,  for  as  long  as  man  has  spent  moving  in 
vehicles,  he  has  been  interested  in  moving  faster  with  less  effort. 

As  an  example  of  how  important  the  reduction  of  drag  would  be  for  the  airline 
industry,  one  can  look  at  work  done  by  Walsh  [1],  who,  in  1985,  estimated  that  if  the 
viscous  drag  experienced  on  the  fuselage  of  airplanes  could  be  reduced  by  10%,  the  cost 
savings  would  be  on  the  order  of  $350  million.  By  extrapolating  to  1998  using  a  3% 
armual  rate  of  inflation  and  a  more  than  doubling  of  air  travel  [2],  those  numbers  translate 
into  a  savings  of  over  $1.1  billion. 

This  report  is  dedicated  to  explaining  an  ongoing  effort  to  reduce  drag  in  turbulent 
boundary  layers  through  the  use  of  a  distributed  MEMS  (micro-electro-mechanical 
system)  to  enact  active  control.  Emphasis  will  be  placed  on  MEMS  components, 
electronics,  and  control.  Before  details  are  presented,  some  background  must  be  given 
regarding  both  the  nature  of  drag  and  some  examples  of  drag  reduction  in  nature. 


Background 


There  are  many  sources  of  drag  on  any  object  moving  through  a  fluid.  Here,  we 
will  only  consider  two  types:  pressure,  or  form,  drag  due  to  flow  separation  and  viscous, 
or  skin-friction,  drag. 

First  of  all,  there  is  form  drag,  which  is  a  result  of  a  negative  pressure  differential 
between  the  front  and  the  back  of  the  object.  A  fluid  moving  past  an  object  will  separate 
from  the  object  and  cause  a  wake  to  be  produced  behind  the  object.  The  downstream 
velocity  in  this  wake  is  less  than  the  upstream  velocity.  By  applying  the  laws  of 
conservation  of  momentum,  the  velocity  lost  by  the  flow  is  transferred  to  a  retarding  force 
(which  can  also  be  thought  of  as  a  pressure  differential)  on  the  body.  Race  car  drivers 
often  take  advantage  of  these  pressure  differentials  by  “drafting”  or  closely  following  the 
driver  ahead  of  them  and  taking  advantage  of  the  fact  that  the  leading  driver  is  “pushing” 
most  of  the  air  out  of  the  way.  To  reduce  form  drag,  it  is  necessary  to  delay  the  onset  of 
separation  for  as  long  as  possible. 

Secondly,  there  is  viscous  drag,  which  exists  within  a  very  thin  region  adjacent  to 
the  object.  The  existence  of  this  “skin”,  called  the  boundary  layer,  is  a  direct  result  of  a 
boundary  condition  applied  to  the  flow  field  known  as  the  no-slip  boimdary  condition. 
Essentially,  this  condition  requires  that  the  tangential  velocity  of  any  flow  next  to  a  solid 
boundary  be  equal  to  the  velocity  of  the  boundary  itself  For  a  non-moving  boundary,  this 
implies  a  tangential  velocity  of  zero.  Within  the  boundary  layer,  then,  the  velocity 
increases  in  a  very  short  distance  from  zero  to  the  free-stream  velocity.  This  fast  change  in 


velocity  results  in  a  shearing  force  at  the  surface  that  is  directly  proportional  to  the  gradient 
of  the  velocity  with  respect  to  the  distance  away  from  the  surface.  The  shear  stress  and  the 
drag,  which  is  the  shear  stress  integrated  over  an  area,  can  be  expressed  as  follows: 

du 

r  =  //— 
dy 

Fd=  frdA 

A 

T  is  the  shear  stress, 
p  is  the  fluid  viscosity. 

In  a  turbulent  flow  across  a  surface,  many  random  structures  [3]  form.  These 
structures  are  within  the  boundary  layer,  directly  next  to  the  wall,  and  are  comprised  of 
counter-rotating  vortex  pairs  that  travel  downstream.  [Figure  1]  In  between  these 
vortices,  faster  moving  (or  higher  momentum)  fluid  is  brought  down  closer  to  the  surface. 
This  down-wash  results  in  a  higher  velocity  gradient,  which  results  in  higher  shear  stress 
and  ultimately,  higher  surface  drag.  By  preventing  the  occurrence  of  such  structures,  or  hy 
reducing  their  effect,  it  is  conceivable  that  drag  reduction  is  feasible  in  turbulent  boimdary 
layers.  Typical  flow  conditions  often  yield  structures  which  have  sizes  on  the  order  of 
millimeters  and  lifetimes  on  the  order  of  milliseconds.  Any  system  designed  to  interact 
with  these  vortices  must  have  sensors  and  actuators  of  the  same  size  and  with  similar 
operating  frequencies.  Using  MEMS,  therefore,  seems  like  a  logical  solution. 


Figure  1.  (a)  Perspective  and  (b)  side  views  of  a  counter  rotating  vortex 
pair  where  Xx,  Xy,  and  Xz  refer  to  dimensionless  lengths  (kinematic  viscosity 
divided  by  wall  friction  velocity). 

In  the  past,  much  effort  has  been  expended  on  the  use  of  passive  techniques,  such 
as  the  use  of  riblets  [4-9]  aligned  with  the  flow,  to  reduce  drag  in  turbulent  boundary 
layers.  Drag  reduction  on  the  order  of  8%  [1]  has  been  observed. 

While  passive  drag  reduction  appears  to  be  promising,  it  also  has  some  drawbacks. 
Because  the  vortices  occur  randomly  in  space  and  time,  any  passive  system  with  no 
sensors  or  feedback  must  necessarily  be  turned  “on”  at  all  times.  As  an  example,  riblets 
prevent  high  momentum  fluid  from  being  transferred  down  to  the  surface  when  counter- 


rotating  vortices  pass  over  them.  Immediately,  one  conelusion  can  be  drawn  from  this 
fact:  since  riblets  are  fixed  in  size,  they  are  apparently  only  effective  when  structures  of  a 
given  size  pass  over  them.  Should  flow  parameters,  and  hence  the  structure  size  change, 
fixed-size  riblets  may  not  be  effective.  Riblets  also  result  in  a  larger  effective  area  over 
which  drag  can  occur.  As  a  consequence,  the  presence  of  riblets  actually  increases  drag 
during  unmatched  flow  conditions. 

Therefore,  any  system  designed  to  counter  the  effeets  of  these  vortices  should  be 
distributed  and  should  have  localized  control.  The  distribution,  however,  does  not 
necessarily  need  to  be  global.  In  fact,  in  simulations  done  by  Choi  [10],  it  was  shown  that 
by  only  controlling  5%  of  an  area,  15%  drag  reduction  could  be  effected.  A  conceptual 
drawing  of  a  microsensor/microactuator/microelectronics  (or  M^)  imit  cell  is  shown  in 
Figure  2. 


Bensor 

Gstttred  efeoteontiDs 


Figure  2.  Drawing  showing  concept  of  a  single  sensor/actuator/electronic 
flow  interaction  unit. 


The  idea  of  localizing  the  control  becomes  important  because  of  the  large  number 
of  distributed  units  needed.  In  addition,  it  probably  isn’t  necessary  for  any  decision¬ 
making  unit  to  know  what  is  happening  globally  in  order  to  make  localized  decisions.  An 
example  can  be  seen  in  the  human  nervous  system.  When  a  person  accidentally  touches  a 
hot  object  widi  his  hand,  it  is  not  the  brain  that  makes  the  decision  to  pull  away.  Local 
areas  of  the  central  nervous  system  can  make  the  decision,  known  as  a  reflex,  before  the 
brain  is  even  aware  of  the  situation.  This  is  an  example  where  no  other  information  about 
other  parts  of  the  body  is  needed  before  the  pull-back  decision  is  made.  The  time  saved  in 
not  sending  the  information  to  the  brain  minimizes  the  damage. 

In  fact,  this  is  not  the  only  concept  in  which  a  nature-inspired  solution  proves  to  be 
of  interest.  Much  effort  has  been  spent  studying  drag  reduction  in  animals  [11].  In 
particular  to  viscous  drag  reduction,  many  examples  have  been  found  in  marine  animals. 
Some  involve  the  coverage  of  surfaces  with  various  “slimes”  [12].  Many  fast  swimming 
sharks  [13]  utilize  a  passive  drag  reduction  scheme  with  small  riblets.  Passive  here  refers 
to  the  fact  that  the  riblets  are  no  more  tiian  simple  attachments  to  the  scales.  No 


movement  or  control  scheme  is  involved. 


Figure  3.  SEM  of  riblets  on  a  fast-swimming  shark.  The  length  scale  is  on 
the  order  of  50  |am. 

Dolphins  and  porpoises  have  been  the  focus  of  many  studies.  In  1936,  Gray  [14] 
performed  some  simple  observations  and  calculations  that  resulted  in  his  belief  that  the 
drag  associated  with  the  flow  around  a  dolphin  must  be  laminar,  if  not  “better”.  He 
believed  that  the  flow  could  not  be  turbulent  because  the  dolphins  simply  could  not 
generate  the  power  required  to  move  through  the  water  at  the  speeds  he  measured.  Gray 
therefore  postulated  that  dolphins  have  some  means  of  turning  turbulent  flow  laminar.  In 
later  works,  others  have  speculated  that  this  may  be  the  result  of  compliant  skins  or 
perhaps  even  active  control.  Others  [15,16,33],  still,  have  postulated  that  many  of  Gray’s 
assumptions  and  measurement  techniques  were  in  error  and  that  dolphins  do  not  have  any 
such  means  of  reducing  drag.  To  this  day,  no  overwhelming  evidence  has  been  obtained 
proving  either  side  correct.  It  is  very  difficult  to  obtain  accurate  experimental  evidence,  as 
the  very  act  of  observing  can  often  affect  the  dolphins  (i.e.  the  very  presence  of  a  boat  may 
spook  the  dolphins  into  going  underwater  or  the  presence  of  a  moving  boat  may  generate 
wakes  which  affect  the  flow  around  the  dolphins). 


As  a  final  note,  whether  the  micromachined  system  will  be  used  on  aircraft  for 
turbulent  shear  stress  reduction  remains  to  be  seen.  However,  the  fundamental  knowledge 
obtained  from  building  a  system  for  controlling  an  intricate  natural  phenomenon  has 
broad  applications  for  numerous  engineering  and  scientific  challenges. 

Shear  Stress  Imager 

A  new  MEMS  shear  stress  sensor  imager  was  developed  for  use  in  this  project,  and 
its  capability  of  imaging  spatial  surface  shear  stress  distributions  has  been  demonstrated. 
The  imager  consists  of  multiple  rows  of  vacuum-insulated  shear  stress  sensors  with  a  300 
pm  pitch.  This  small  spacing  allows  it  to  detect  surface  flow  patterns  that  could  not  be 
directly  measured  before.  The  high  frequency  response  (30  kHz)  of  the  sensor  under 
constant  temperature  bias  mode  also  allows  it  to  be  used  in  high  Reynolds  number 
turbulent  flow  studies. 

The  statistical  size  of  the  drag-inducing  vortex  pair  streak  decreases  as  the 
Reynolds  munber  of  the  flow  increases.  For  a  typical  airflow  of  15  m/s  in  our  wind  tunnel 
tests,  and  a  Reynolds  number  of  10'*,  the  vortex  streaks  have  a  mean  width  of  about  1  mm. 
The  length  of  a  typical  vortex  streak  can  be  about  2  cm,  giving  the  streaks  a  20:1  aspect 
ratio.  The  frequency  of  appearance  of  the  streaks  is  approximately  100  Hz.  The  life-time  is 
about  1  ms  [3]. 

There  are  many  ways  exist  to  do  a  wall  shear  stress  measurement  [17].  Among 
them  is  the  thermal  method,  which  uses  hot  film  sensors  to  determine  the  shear  stress 
indirectly.  This  technique  has  many  advantages  over  other  methods  for  real-time  flow 
measurement  and  control.  For  example,  it  can  achieve  high  sensitivity  and  high  frequency 


response  while  keeping  the  sensor  size  small.  Traditional  hot  film  sensors  are  electrically 
heated  thin  metal  film  resistors  on  substrates.  Since  the  shear  stress  is  dependant  on  the 
convection  of  heat,  which  alters  the  resistivity  of  the  polysilicon,  the  thin  film  resistor 
needs  to  be  isolated  firom  the  substrate,  minimizing  the  conductive  heat  loss  and  thereby 
increasing  sensitivity.  In  the  past,  the  only  way  to  partially  solve  the  problem  was  to  use 
low  thermal  conductivity  materials  such  as  quartz  for  the  substrate.  Reasonably  good 
sensitivity  could  be  obtained  only  when  such  sensors  are  used  to  measure  high  thermal 
conductivity  fluid  such  as  water.  However,  they  are  not  sensitive  enough  for  the 
measurement  in  low  thermal  conductivity  fluids  such  as  air.  Moreover,  the  size  of 
traditional  sensors  is  typically  in  the  mm  range  [17].  This  may  be  tolerable  in  measuring 
the  mean  value  of  shear  stress,  but  is  definitely  not  acceptable  in  shear  stress  imaging  with 
reasonable  spatial  resolution. 

Thanks  to  the  development  of  surface  micromachining  technology,  we  can 
optimize  both  the  materials  and  the  structure  of  the  sensors.  Figure  4  shows  the  cross- 
sectional  structures  of  a  few  types  of  the  micromachined  shear  stress  sensors.  Type  I 
features  a  2  pm  deep  vacuum  cavity  with  a  0.25  pm  thick  polysilicon  wire  embedded  in 
the  nitride  diaphragm.  Here  the  vacuum  cavity  is  designed  to  thermally  isolate  the 
diaphragm  from  the  substrate  [18].  Type  II  has  a  similar  structure  to  type  I  except  that  the 
polysilicon  wire  is  lifted  4  pm  above  file  diaphragm,  thus  achieving  better  thermal 
isolation.  Type  III  is  a  conventional  polysilicon  bridge  sitting  on  the  solid  substrate  [19]. 
Type  IV  is  basically  a  micromachined  hot  wire  close  to  a  wall  as  has  been  previously 
reported.  The  wire  is  a  few  microns  above  the  substrate  surface  and  is  in  the  linear  velocity 
distribution  region  so  that  it  measures  the  wall  shear  stress  instead  of  velocity  [20].  All 


four  types  were  fabricated  on  a  single  chip  to  ensure  identical  thermal  and  electrical 
properties  of  the  sensor  materials.  Figure  5  shows  their  calibration  results  in  wind-tunnel 
tests.  The  output  changes  are  proportional  to  the  one-third  power  of  shear  stress,  which 
agrees  with  the  heat  transfer  theory  [17].  It  is  obvious  that  types  I  and  II  are  the  most 
sensitive  ones.  Moreover,  type  I  has  a  much  simpler  fabrication  process  than  type  H. 
Therefore,  it  was  chosen  as  the  building  block  of  this  generation  of  shear  stress  imaging 
chip  to  be  described  below. 
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Figure  4.  Structures  of  the  sensors. 


Figure  5.  Wind-tunnel  calibration  of  sensors. 

After  the  structure  is  decided,  the  geometry  of  each  layer  is  optimized  to  give 
maximum  sensitivity.  It  is  found  that  the  sensitivity  is  higher  for  thirmer  diaphragms  and 
larger  aspect  ratio  polysilicon  resistors  (i.  e.  larger  LAV,  where  L  is  the  length  and  W  the 
width).  But  the  diaphragm  can  not  be  too  thin  as  it  would  otherwise  break  during 
fabrication  or  operation.  L  is  limited  by  the  size  of  the  whole  device,  which  is  at  most  300 
pm  for  the  application  in  shear  stress  imaging.  Therefore,  L  is  chosen  to  be  150  pm.  Also 


w  is  limited  by  the  photolithography  and  etching  technology.  In  this  case,  it  is  designed  to 
be  3  pm  to  ensure  good  uniformity. 

Figure  6(a)  shows  the  photomicrograph  of  an  individual  sensor  while  Figure  6(b) 
shows  the  2.85  cm  x  1 .0  cm  imaging  chip  using  type  I  shear  stress  sensor.  It  is  specifically 
designed  for  studies  in  turbulent  flow  with  the  Reynolds  number  near  lO'*.  There  are  two 
identical  sensor  rows  5  mm  apart,  parallel  to  the  broad  side  of  the  chip.  There  are  also 
various  test  structures,  including  a  25  sensor  test  row,  on  this  chip.  This  vertical  pitch  is 
chosen  such  that  at  least  four  data  points  can  be  taken  from  a  streak  in  the  streamwise 
direction.  Each  row  has  25  sensors  with  300  mm  pitch,  which  is  already  the  minimum  for 
this  type  of  sensor.  It  should  give  at  least  three  data  points  from  a  streak  in  the  spanwise 
direction  and  be  able  to  catch  more  than  one  streak.  The  1  cm  spacing  between  the  sensors 
and  the  left  and  right  edges  of  the  chip  are  necessary  to  avoid  the  upstream  bonding  wires 
from  interfering  with  the  downstream  sensors. 
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Figure  6.  (a)  SEM  of  individual  sensor  and  (b)  shear  stress  imaging  chip. 

The  fabrication  process  of  the  shear  stress  imager  starts  with  the  deposition  of 
4000A  of  LPCVD  low  stress  silicon  nitride.  Following  a  lithography,  the  nitride  is 
removed  by  plasma  with  a  little  over-etch  to  give  a  7000-8000A  cavity.  Then  the  wafer  is 
put  in  oxidation  furnace  to  grow  1 .7  pm  thick  oxide  in  the  cavity.  After  short  time  etch  in 
BHF  to  remove  the  oxidized  nitride,  4000A  PSG  is  deposited,  patterned  and  aimealed  to 
form  the  sacrificial  layer  etching  channel.  Next,  1.2  pm  of  LPCVD  low  stress  nitride  is 
deposited  as  the  diaphragm  material.  Etching  holes  are  opened  to  expose  the  end  of  the 
PSG  etching  channel,  followed  by  a  49%  HF  etching  to  completely  remove  the  PSG  and 
thermal  oxide  underneath  the  diaphragm.  The  cavity  is  then  sealed  by  LPCVD  LTO  and 
nitride  deposition  at  a  vacuum  of  200  mTorr.  The  sealing  materials  on  the  diaphragm  are 
removed  by  plasma  and  BHF  etching  to  minimize  the  diaphragm  thickness.  4000A 
polysilicon  layer  is  deposited,  doped,  annealed  and  patterned  to  form  the  resistor  on 
diaphragm.  Another  2000A  of  low  stress  nitride  is  deposited  to  passivate  the  resistor.  After 


the  contact  hole  opening,  metallization  is  done  and  the  wafer  is  diced  to  2.85cm  x  1cm 
chips.  The  fabrication  process  flow  is  shown  in  Figure  7. 
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Figure  7.  Process  flow  for  shear  stress  imager. 

The  package  for  the  imaging  chip  is  a  fine-line  PC  board  with  a  recess  in  the  center 
so  that  the  imaging  chip  can  be  flush-mounted.  The  chip  and  the  PC  board  are  electrically 
connected  by  wire  bonding.  The  electrical  leads  are  soldered  on  the  backside  of  the  PC 
board.  The  PC  board  is  then  flush-mounted  on  a  specially-made  plug  which  fits  into  the 
wall  of  the  wind-tunnel  (Fig.  8),  with  the  sensor  row  perpendicular  to  the  flow  direction. 


Figure  8.  Imaging  chip  packaging  and  experimental  set-up 
The  wind  tunnel  supplies  a  two-dimensional  channel  flow.  The  channel  is  4.87m 
long  with  an  cross-sectional  area  of  60cm  x  2.5cm.  The  walls  of  the  channel  are 
constructed  of  2.5cm  thick  Plexiglas  and  supported  by  a  steel  frame.  An  axial  blower 
powered  by  a  DC  source  supplies  the  air  flow  in  the  chaimel.  At  the  highest  blower  speed, 
the  centerline  velocity  in  the  channel  is  about  25  m/s.  Hot-wire  velocity  measurements  at 
10  m/s  indicate  that  the  channel  consists  of  a  laminar  entrance  flow  region  which  gradually 
transforms  into  a  fully-developed  turbulent  flow  in  the  downstream  2/3  portion  of  the 
channel.  All  calibration  and  testing  of  the  imaging  chip  is  carried  out  in  this  region  of  the 
channel. 

In  our  experiments,  the  sensors  are  biased  in  constant  temperature  (CT)  mode. 
Although  it  is  more  complicated  than  the  constant  current  mode,  it  can  achieve  much 
higher  frequency  bandwidth  which  is  crucial  in  turbulence  measurement.  Therefore,  arrays 
of  CT  circuits  and  gain  stages  have  been  made  on  PC  boards  using  op-amps  and  discrete 
components  (Figure  9).  The  dc  offset  of  the  ou^uts  can  be  adjusted  individually,  but  the 


gain  is  fixed  to  be  10.  A  eomputer-controlled  data  acquisition  system  is  used  to  measure 
all  the  outputs  simultaneously. 
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Figure  9.  Constant  temperature  biasing  circuit,  gain  stage  and  temperature 
compensation  stage. 

Before  the  sensors  are  used  to  measure  the  shear  stress  distribution,  their  dc  outputs 
are  calibrated  against  known  wall  shear  stress  levels  which  are  calculated  firom  the 
centerline  velocity  by  using  the  following  relationship  [21], 
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where  Re  is  the  Reynolds  number,  Uc  is  the  centerline  velocity  of  the  channel,  Ur  is  the 
fiiction  velocity,  d  is  the  half  width  of  the  channel  (=1.27  cm)  and  is  used  to  calculate  Re, 
Tw  is  the  wall  shear  stress,  and  pf  is  the  density  of  air. 


Figure  10  shows  the  calibration  results  for  10  sensors  in  a  row.  Although  each 
sensor  has  different  offset,  the  trend  of  all  curves  are  almost  the  same.  Polynomial  fitting  is 
performed  on  each  curve  to  extract  the  fitting  parameters  for  later  use  in  real  data 
processing. 


Figure  10.  Calibration  curves  of  10  sensors  in  a  row. 

It  is  important  to  note  that  the  dc  outputs  of  the  sensors  are  sensitive  to  the  fluid 
temperature  because  the  shear  stress  sensor  is  essentially  a  thermal  sensor  with 
temperature  coefficient  of  resistance  around  0.1%/®C.  This  dc  drift  can  be  compensated  by 
measuring  the  sensor  temperature  sensitivity  and  monitoring  the  fluid  temperature  change 
using  a  temperature  sensor.  Figure  1 1  shows  that  an  order  of  magnitude  of  improvement 
on  the  ftiermal  stability  has  been  achieved  by  the  compensation  circuit  given  in  Figure  9. 
Here  the  temperature  sensor  is  just  another  shear  stress  sensor  operated  at  very  low  power 
such  that  the  self-heating  is  negligible. 


Figure  11.  Typical  temperature  sensitivities  before  and  after  temperature 
calibration. 

To  confirm  that  the  sensors  have  enough  bandwidth  to  pick  up  all  the  information 
in  a  turbulent  flow  investigation,  the  frequency  response  of  a  sensor  in  CT  mode  is 
measured  using  a  sinusoidal  electrical  testing  signal  Vt.  Figure  12  shows  that  this 
bandwidth  reaches  30  kHz,  which  is  sufficient  for  the  turbulent  flow  under  study.  The 
deviation  between  the  experimental  data  and  the  fitted  curve  is  due  to  the  fact  that  a  real 
sensor  has  multiple  thermal  time  constants. 


Figure  12.  Frequency  response  of  a  CT  sensor.  Derivation  of  the 
theoretical  curve  is  shown  in  [22]. 

Magnetic  Actuator 

Many  generations  of  active  magnetic  flaps  were  designed  and  fabricated  for  use  in 
the  drag  reduction  project.  The  flap  structure  was  chosen  because  it  can  provide  relatively 
large  displacements  at  high  fi-equencies.  Magnetic  actuation  was  chosen  due  to  the  long 
range  over  which  the  force  can  be  exerted  and  the  fast  manner  in  which  the  force  can  be 
felt.  The  first  generation  was  fabricated  as  a  proof  of  concept  and  to  validate  our  ideas  that 
the  use  of  magnetic  forces  was  a  viable  means  to  actuate  micromachined  devices.  Later 
generations  were  designed  and  fabricated  to  provide  more  robust  devices  (here,  the  term 
robust  implies  both  actual  device  robustness  as  well  as  process  robustness)  with  the 
eventual  goal  of  integration  in  mind.  Sometimes,  a  new  generation  would  be  fabricated 
because  our  fluid  mechanics  partners  believed  that  a  particular  geometry  would  prove  to 
be  advantageous. 


The  first  generation  [XXX]  flap  consisted  of  three  layers  (see  Figures  13  and  14). 
A  structural  layer  of  silicon  nitride  was  sandwiched  between  a  top  layer  of  metal  coils  and 
a  bottom  layer  of  heavily  phosphorous  doped  polysilicon.  The  polysilicon  was  used  as  a 
return  path  for  the  current.  Several  different  geometries  were  fabricated.  Plates  of 
different  sizes  and  shapes  were  designed.  Some  flaps  had  four  comer  supports  instead  of 
two  bending  beams.  Different  etch  hole  sizes  and  placements  were  also  designed. 


Fig  13.  SEM  of  a  first  generation  flap. 
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Figure  14.  Typical  (a)  deflection  and  (b)  resonant  frequency  curves  for 
the  first  generation  actuator. 

Unfortunately,  the  use  of  polysilicon  as  the  return  path  for  current  was  the  root  of 
two  major  problems:  bending  due  to  stress  and  excessive  plate  heating.  The  mismatch  of 
the  stresses  of  the  polysilicon  and  silicon  nitride  films  caused  the  flaps  to  curl  up 
considerably.  In  general,  this  would  cause  problems  in  the  sense  that  there  would  be  a 
flow  disturbance  at  all  times.  For  the  first  generation,  however,  this  was  actually  a 


serendipitous  event  because  the  stress  served  to  provide  a  force  to  oppose  the  stiction 
forces.  Without  the  natural  bending,  the  flaps  would  have  all  stuck  to  the  substrate. 

Polysilicon  has  a  resistivity  that  is  orders  of  magnitude  higher  than  that  of  metals. 
In  fact,  in  IC  processes,  polysilicon  is  used  as  a  resistor  material.  For  our  purposes,  having 
a  high  resistance  is  detrimental.  First  of  all,  keeping  in  mind  that  the  eventual  goal  is  a 
distributed  system  with  thousands  of  devices,  the  power  consumption  for  each  actuator 
must  be  minimized.  Second,  the  heat  generated  induces  a  thermal  bending  of  the  plate  that 
may  not  be  desirable.  Finally,  the  generation  of  heat  may,  in  fact,  affect  the  flow  in  a 
negative  way  by  adding  energy  to  an  already  turbulent  flow. 

Therefore,  the  second  generation  [XXX]  was  designed  and  fabricated  with  the 
intent  of  reducing  the  power  consumption  and  also  reducing  the  bending  due  to  intrinsic 
stresses.  As  previously  mentioned,  the  use  of  polysilicon  is  the  cause  of  both  problems. 
All  second  generation  devices  [Figure  15]  are  characterized  by  the  use  of  PSG  as  the 
sacrificial  layer  while  using  two  layers  of  metal  for  the  coils. 


Figure  15.  Second  generation  magnetic  actuator. 


Typically,  the  IC  industry  uses  some  sort  of  LPCVD  LTO  as  metal  insulation. 
Immediately,  one  problem  with  using  LTO  in  this  process  becomes  apparent.  Oxide  is 
also  used  as  the  sacrificial  layer.  Therefore,  the  metal  insulation  would  also  be  etched 
away,  leaving  behind  air  bridges.  Other  LPCVD  options  were  not  available  to  us  due  to 
temperature  considerations.  The  low  deposition  temperature  of  LTO  is  important  because 
the  metal  must  not  be  melted  or  evaporated  during  the  deposition.  Even  though  the 
melting  points  of  chrome  and  gold  are  significantly  higher  than  that  of  aluminum,  neither 
metal  can  stand  LPCVD  silicon  nitride  temperatures. 

One  simple  alternative  to  using  LTO  as  the  insulation  is  to  use  photoresist,  which 
can  be  spun  on  in  a  room  temperature  process.  Photoresist  is  attacked  by  high 
concentrated  HF  acid,  but  is  tends  to  break  off  in  chunks  as  opposed  to  being  dissolved 
away  (as  oxide  is).  Therefore,  the  possibility  exists  that  while  the  resist  integrity  would  be 
compromised,  enough  may  remain  to  support  the  top  layer  of  metal.  Another  alternative  is 
to  etch  using  buffered  HF  acid,  which  attacks  oxide  much  slower  than  high  concentrated 
HF  acid  does,  but  does  not  appreciably  attack  resist.  Results  of  attempting  these  options 
are  discussed  below. 

An  alternative  to  LPCVD  films  for  insulation  would  be  PECVD  films  [Figure  16]. 
Our  lab  has  the  capability  of  depositing  PECVD  silicon  nitride  at  a  temperature  of  300”  C. 
Due  to  the  nature  of  their  deposition,  PECVD  films  generally  have  poorer  chemical 
resistance  than  LPCVD  films.  Therefore,  PECVD  nitride  is  appreciably  etched  in  HF  acid. 
However,  if  the  flap  can  be  freed  before  the  insulating  nitride  is  completely  etched  away, 
this  process  could  also  provide  a  viable  means  to  insulate  the  coils. 


Figure  16.  SEM  of  2"^  generation  flap  using  PECVD  nitride  as  the  ILD. 

The  third  generation  of  actuators  was  conceived  with  the  idea  of  eventual  process 
integration  in  mind.  Up  to  this  point,  all  of  the  actuators  used  PSG  as  the  sacrificial  layer. 
Unfortunately,  as  previously  mentioned,  IC  processes  typically  use  oxide  both  as  inter¬ 
metal  dielectrics,  but  also  for  overall  circuit  encapsulation  (often  called  an  overglass 
layer).  Since  the  release  step  is  usually  the  final  step  in  any  process  involving  the  flap,  the 
HF  acid  would  horribly  attack  all  of  the  oxide  required  for  the  electronics  portion  of  the 
chip.  Therefore,  a  different  sacrificial  layer  process  needed  to  be  developed. 


Figure  17.  A  low  temperature  actuator. 


For  integration  reasons,  metal  was  one  option  explored  for  use  as  a  sacrificial  layer. 
Metal  can  be  deposited  in  thick  layers  relatively  easily  and  at  low  temperatures.  When 
using  a  metal  sacrificial  layer,  care  must  be  taken  to  choose  a  structural  layer  that  can  also 
be  deposited  at  a  low  temperature.  Therefore,  using  a  metal  sacrificial  layer  in  conjunction 
with  an  LPCVD  oxide  structural  layer  was  investigated.  Two  choices  were  considered  for 
the  sacrificial  layer:  aluminum  and  copper.  An  actuator  fabricated  using  copper  as  the 
sacrificial  layer  is  shown  in  Figure  17. 

The  fourth  generation  of  flaps  [Figure  18]  was  designed  with  the  ideas  in  mind  that 
(a)  a  combined  passive/active  flap  may  prove  beneficial,  and  (b)  the  passive  component, 
permalloy,  could  also  be  used  to  aid  in  a  magnetic  release  step.  This  generation  of  flap 
was  designed  with  the  idea  in  mind  of  IC  process  compatibility.  Therefore,  some  versions 
of  these  flaps  used  polysilicon  as  the  sacrificial  layer.  One  particularly  nice  aspect  of  using 
polysilicon  as  the  sacrificial  layer  is  the  possibility  of  releasing  structures  using  a  dry,  gas- 
phase  etchant. 


Figure  18.  A  hybrid  flap. 


Finally,  a  fifth  generation  was  designed  [Figure  19].  This  generation  is  completely 
different  from  the  previous  four  in  the  sense  that  it  uses  a  bulk,  rather  than  surface, 
micromachined  process.  The  primary  reason  for  this  is  the  desire  for  thicker  beams,  which 
result  in  a  higher  resonant  frequency,  than  possible  with  a  surface  micromachined  process. 


Figure  19.  A  bulk  micromachined  flap. 


Shear  Stress  Measurements  and  Control 


Single  Point  Surface  Shear  Stress  Measurement 


For  real-time  shear  stress  imaging,  the  output  voltage  is  sampled  at  lOkHz  and 
converted  to  shear  stress  signal  based  on  the  calibration  performed  previously.  In  order  to 
establish  the  credibility  of  the  imaging  chip,  the  turbulence  statistics  calculated  from  the 
shear  stress  fluctuations  recorded  by  a  single  shear  stress  sensor  are  compared  to 
previously  established  results.  Figure  20  shows  the  comparison  in  terms  of  the  normalized 
RMS  level,  the  skewness  factor,  and  the  flatness  factor.  It  is  obvious  that  the  present 
results  agree  very  well  with  previous  studies  in  all  three  areas  [23-25].  In  addition,  the 
present  statistics  appear  to  be  independent  of  the  Reynolds  number  which  is  predicted  by 
turbulence  theory. 


Figure  20.  Turbulence  statistics  measured  by  a  single  micro  shear-stress 
sensor. 

As  can  be  seen,  the  turbulence  statistics  measured  by  the  micro  sensor  compare 
very  well  with  both  previous  experimental  and  numerical  results. 

Distributed  Measurement 


Next,  the  instantaneous  turbulent  shear  stress  distributions  of  the  channel  were 
recorded  by  using  one  of  the  sensor  rows  on  the  imaging  chip.  Figure  21  shows  die 


contour  plots  of  the  instantaneous  shear  stress  distributions  at  two  different  centerline 
velocities.  The  white  streaky  structures  in  the  plots  represent  regions  of  high  shear  stress 
on  the  wall  of  the  channel  where  the  imaging  chip  is  located.  They  are  caused  by  the 
presence  of  near-wall  streamwise  vortices  which  bring  high  momentum  fluid  from  the 
freestream  to  the  wall.  Due  to  the  small-scale  nature  of  these  structures,  previously 
experiments  in  turbulent  boundary  layers  have  only  succeeded  in  qualitatively 
demonstrating  their  existence  without  obtaining  any  quantitative  information.  This  is  the 
first  time  that  the  instantaneous  shear  stress  levels  associated  with  the  near-wall  structures 
are  recorded. 


7.5  mm  7.5  mm 

(25  sensors)  Spanwise  direction  (25  sensors) 


Figure  21.  Contour  plots  of  the  pseudo  2-D  shear  stress  distributions.  Red 
area  indicates  high  shear  stress  and  blue  area  indicates  low  stress. 

The  contour  plots  in  Figure  21  indicate  that  the  scales  of  the  streaks  are  different  at 

different  centerline  velocities.  The  streaks  in  the  high-speed  (20  m/s)  case  appear  to  be 

thiimer  and  more  densely  packed  than  those  in  the  low-speed  (8  m/s)  case.  Similar 

phenomena  have  also  been  observed  in  previous  experiments.  Based  on  real-time  movies 


generated  from  the  contour  plots  similar  to  the  ones  in  Figure  21,  the  average  streamwise 
length,  the  average  spanwise  scale,  and  the  average  spanwise  spacing  of  the  streaks  at 
three  different  centerline  velocities  are  estimated  and  shown  in  Figure  22.  Once  again,  the 
present  results  agree  well  with  previous  studies.  The  average  length  of  the  present  streaks 
falls  within  the  upper  and  lower  boimds  of  the  established  results.  The  average  spanwise 
scale  and  spacing  of  the  present  streaks  are  either  on  or  slightly  higher  than  the  upper 
bound. 
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Figure  22.  Scales  of  the  near- wall  streaky  structures  at  different  Reynolds 
numbers,  (a)  The  length  of  streaks,  (b)  The  spanwise  scale  of  streaks,  (c) 
The  spanwise  spacing  of  streaks. 


Edge  Detector  for  Detecting  High  Shear-Stress  Regions 


The  high-shear  stress  streaks  are  randomly  distributed.  A  real-time  detection 
scheme  is  needed  to  properly  identify  the  streaks  among  the  background  fluctuations  so 
that  the  downstream  actuator  can  be  activated.  A  neural  network  based  detection  circuit 
(described  later)  was  developed  for  this  purpose.  In  this  circuit,  the  signal  output  of  a 
sensor  is  compared  with  its  neighboring  sensors  through  a  filter-threshold  combination. 
Whenever  a  sudden  change  is  detected,  the  boundary  of  the  high  shear  stress  region  is 
marked.  This  circuit  has  been  used  in  conjunction  with  the  imaging  chip  and  a  typical 
result  is  shown  in  Figure  23.  The  light  region  in  the  control  output  (Fig.  23-b)  represents 
‘positive’  identification  of  high  shear-stress  streaks.  The  good  matching  between  the  light- 
color  region  and  the  high  shear-stress  area  (light  gray  region)  measured  by  the  imaging 
chip  indicates  the  effectiveness  of  the  detection  circuit. 


(a)  (b) 

Figure  23.  Contours  of  (a)  die  instantaneous  surface  shear  stress  and  (b)  the 
output  of  the  control  circuits. 


Interaction  between  Micromachined  Actuator  and  Streamwise  Vortices 


To  properly  control  the  high  shear-stress  streaks  with  micro  actuators  [26,27],  an 
in-depth  understanding  of  the  interaction  between  the  actuators  and  the  flow  structures  is 
required.  Experiments  have  been  carried  out  to  investigate  the  interaction  between  a 
single  high  shear-stress  streak  and  a  micromachined  actuator.  In  this  study,  a  1.3  mm  thick 
vortex  generator  is  used  to  generate  a  longitudinal  vortex  pair  in  a  laminar  bovmdary  layer. 
The  micromachined  actuator  used  is  a  silicon  flap  with  30  turns  of  copper  coil  and 
matches  the  physical  size  of  the  longitudinal  vortex  pair.  Oscillation  of  the  flap  is 
achieved  by  the  combination  of  an  external  permanent  magnet  and  an  AC  coil  current. 
The  maximum  deflection  of  the  flap  is  about  30  degrees,  which  corresponds  to  height  of  2 
mm  fi-om  the  tip  of  the  actuator  flap  to  the  wall.  The  maximum  oscillation  fi:equency  is 
about  100  Hz.  The  actuator  is  placed  downstream  fi:om  the  vortex  generator  where  the 
longitudinal  vortex  pair  is  generated.  Experiments  are  carried  out  for  different 
combinations  of  actuator  frequency  (co)  and  maximum  tip  height  (d).  For  each 
combination,  a  two-component  (streamwise  and  spanwise)  hot-wire  anemometer  is  used  to 
measure  the  velocity  distributions  downstream  fi-om  the  actuator.  From  the  velocity 
distributions,  the  vorticity  and  surface  shear  stress  distributions  are  calculated  and  phase 
averaged. 

The  longitudinal  vortices  generated  by  the  vortex  generator  convect  high-speed 
fluid  to  the  wall,  inducing  a  local  high  shear-stress  streak  (Figure  24)  on  the  surface.  As 
the  actuator  oscillates,  it  generates  perturbations  in  the  flow  which  change  the  shear  stress 
intensity  of  the  streak.  As  indicated  by  the  ensemble-average  result  shown  in  Figure  24, 
the  intensity  of  the  streak  decreases  as  the  flap  actuator  deflects  away  from  the  surface, 
reaching  a  minimum  at  the  maximum  actuator  tip  height  (phase  =7t).  As  the  actuator 


moves  back  to  the  surface,  the  streak  intensity  reverses  to  its  original  level.  To  evaluate 
the  net  effect  of  the  actuator  oscillation  on  the  shear  stress  distribution,  the  net  shear  stress 
coefficient,  Cdn,  defined  as 

Coat  =  J[[Cd(^)  -  Co  ]d6 , 

for  different  co-d  combinations  are  evaluated  and  shown  in  Figure  25.  The  Cd.vg  in  the 
equation  is  the  time-invariant  shear  stress  coefficient  associated  with  the  stationary  high 
shear-stress  streak  induced  by  the  vortex  generator.  Defined  this  way,  Cdn  indicates  a 
shear  stress  increase  if  positive  and  a  drag  decrease  if  negative.  As  shown  in  Figure  25, 
higher  drag  reduction  is  achieved  at  higher  o  and  higher  d.  In  addition,  similar  shear  stress 
reduction  is  the  result  if  the  product  of  co  and  d  is  the  same.  Since  the  product  of  co  and  d 
is  a  measurement  of  the  transverse  velocity  of  the  actuator  flap,  this  result  indicates  that 
the  amount  of  shear  stress  reduction  is  directly  related  to  the  transport  of  high-speed  fluid 
away  from  the  surface  by  the  vertical  motion  actuator. 


Surface  Shear  Stress  Map 
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(a)  (b) 

Figure  24.  Contours  of  ensemble-average  dU/dy.  (a)  Vortex  generator 
only;  (b)  Vortex  generator  and  an  oscillating  micro  actuator. 


Figure  25.  Variation  of  the  shear  stress  coefficient,  Cdn,  with  actuator 
frequency,  co,  and  maximum  actuator  tip  height,  d.  Each  line  is  the  result  of 
one  d,  as  indicated  at  the  end  of  the  line.  The  solid  O  markers  correspond 
to  an  cod  of  80  and  the  solid  □  markers  correspond  to  an  cod  of  100. 

Integration 

As  shown  above,  the  results  of  sensor/flow  and  actuator/flow  interaction  are  very 
promising.  The  next  step,  then,  is  to  integrated  sensors,  actuators  and  electronics  on  the 


same  substrate  to  form  the  system.  Towards  this  goal,  an  integration  chip  was 
fabricated  but  has  not  yet  been  extensively  tested  [28],  There  are  many  generic  concerns 
in  MEMS-Electronics  integration  that  have  been  addressed  in  this  work.  Elements  of  this 
MEMS-Electronics  integration  effort  can  be  applied  to  other,  more  generic  integration 
efforts. 

The  primary  concern  of  all  integration  efforts  revolves  around  the  choice  of  when 
the  MEMS  processing  steps  are  completed  relative  to  the  IC  processing  steps.  There  are 
three  options:  (1)  an  interweaved  process,  (3)  electronics  first,  followed  by  MEMS,  (3) 
MEMS  first,  followed  by  electronics,  and.  Each  alternative  has  its  advantages  and 
disadvantages. 

One  assumption  being  made  while  comparing  these  different  options  is  that  the 
MEMS  designer  does  not  have  full  and  high  priority  access  to  an  IC  capable  fabrication 
facility.  This  is  a  reasonable  assumption  for  an  academic  environment  where  few 
laboratories  even  have  basic  MEMS  capabilities,  let  alone  IC  fabrication  facilities.  In  the 
few  institutions  (Berkeley  and  Stanford  to  name  two)  in  the  entire  country  which  have 
CMOS  capabilities,  the  electronics  yield  is  usually  unacceptably  low  compared  with 
industry.  The  lack  of  dedicated  access  to  a  fabrication  line  is  not  limited  to  academia. 
Even  in  a  corporation  with  large  fabrication  facilities,  internal  MEMS  research  facilities 
often  are  completely  separate  from  VLSI  lines  and  MEMS  researchers  often  are  not 
allowed  full  access  to  such  lines.  If  full  access  were  available  to  MEMS  researchers,  the 
interweaved  process  would  almost  definitely  be  the  choice  for  reasons  that  \vill  be 


discussed  below. 


From  a  device-robustness  point  of  view,  the  interweaved  process  is  ideal.  The 
order  in  which  steps  are  completed  reflects  a  process  flow  designed  with  optimum  device 
performance  in  mind.  This  is  in  stark  contrast  to  the  other  approaches,  which  often  require 
steps  that  are  not  needed  for  the  eventual  devices  (or  electronics)  per  se,  but  rather  are 
crucial  for  the  overall  survival  of  the  process.  For  example,  in  an  electronics-first  process, 
during  the  MEMS  processing,  layers  are  often  deposited,  patterned,  and  later  removed 
simply  to  protect  the  aluminum  metalization  on  the  electronics  portion  of  the  wafer.  In  an 
interweaved  process,  the  metalization  steps  could  occur  after  any  harsh  MEMS  processing 
steps  have  been  completed.  From  this,  it  can  be  seen  that  the  interweaved  process  also  is 
usually  the  shortest  (both  in  time  and  in  number  of  processing  steps).  General  processing 
wisdom  holds  that  the  fewer  the  steps,  the  more  robust  the  process.  For  commercial 
applications,  fewer  steps  also  reduces  the  cost  significantly  while  increasing  the  yield. 

Extremely  few  IC  facilities  will  allow  pre-processed  wafers  to  enter  their 
fabrication  lines  due  to  fear  of  contamination,  especially  the  fear  of  unknown  (by  VLSI 
standards)  MEMS  materials.  Therefore,  finishing  the  IC  processing  first  has  one  primary 
advantage:  one  has  many  choices  of  IC  foundries  that  can  provide  wafer  level  electronics. 
After  the  foundry  fabricates  the  IC  electronics,  the  wafers  are  then  subject  to  MEMS 
processing,  which  can  be  completed  in  a  facility  separate  from  the  VLSI  fabrication  line. 

Oftentimes,  the  choice  of  which  approach  to  take  for  integration  depends  on 
metalization  concerns.  Typically,  VLSI  fabrication  uses  aluminum  metalization. 
Aluminum  is  considered  a  low  temperature  material  and  cannot  withstand  processing 
temperatures  above  450®  C.  Therefore,  certain  steps  such  as  diffusion  and  oxidation 
cannot  be  attempted  after  aluminum  deposition.  If  electronics  including  metalization  is 


completed  before  MEMS  processing,  the  user  then  can  only  use  low  temperature  steps  to 
finish  the  process.  Other  considerations  regarding  metalization  are  that  (1)  ideally  only 
one  deposition/pattem/etch  step  should  be  used  for  each  metal  layer,  and  (2)  fine  line 
widths  are  often  required  for  the  electronics  portion  of  the  chip.  These  two  concerns  may 
imply  that  the  intelligent  approach  is  to  complete  all  the  metalization  (for  a  given  layer  of 
metal)  at  one  time  and  to  do  it  at  the  VLSI  fabrication  facility,  where  fine  line  metal 
patterning  and  etching  are  well  characterized  steps.  There  may  exist  some  facilities  which 
will  take  pre-processed  wafers  on  a  one-time  basis,  in  which  case  MEMS  processing  up  to 
but  not  including  metalization  can  be  done  before  submitting  the  wafers  to  the  foundry. 
Complete  CMOS  processing  is  then  completed,  with  the  metalization  connecting  both 
electronic  and  MEMS  devices. 

Finally,  although  less  interesting  from  a  research  point  of  view,  economic  reasons 
can  often  dictate  which  option  is  chosen.  When  choosing  an  outside  foundry  as  a  vendor, 
it  should  be  noted  that  the  least  expensive  option  involves  requesting  as  standard  a 
package  as  possible.  This  usually  implies  that,  of  the  three  above-mentioned  options,  the 
second  (electronics  first,  then  MEMS)  option  may  be  the  most  viable.  Cost,  however,  is 
often  not  the  only  economic  concern.  The  idea  of  interweaving  a  process  is  often  the  most 
appealing  option  firom  a  technological  point  of  view.  From  a  foimdry’s  point  of  view,  it  is 
also  the  most  costly.  While  most  foundries  will  not  turn  down  business  that  requires 
standard  technology,  they  do  require  financial  justification  before  attempting  an  expensive 
custom  run  which  could  divert  their  manpower  and  resources  from  other  high-volume 
projects.  Foundries  such  as  Standard  Microsystems  Corporation  (SMC)  which  have 
dedicated  MEMS-Electronics  fabrication  facilities  often  also  have  long  lines  of  corporate 


customers  buying  high  volumes  of  devices.  They  often  are  not  interested  in  low  volume 
applications  (such  as  university  research  or  small  company  products). 

Therefore,  oftentimes,  small  volume  applications  are  limited  to  pursuing  the 
electronics  first  approach.  If  high  temperature  MEMS  steps  are  required,  it  is  possible  to 
switch  to  an  interweaved  approach  where  the  electronics  processing  up  to,  but  not 
including,  metalization  is  completed  at  the  VLSI  foundry.  Next  come  the  MEMS  steps, 
and  the  process  is  completed  with  metalization  at  the  MEMS  facility  (or  another  facility 
willing  to  take  pre-processed  wafers).  This  approach  requires  the  ability  to  dry  etch 
contact  holes,  pattern  fine  lines,  and  dry  etch  aluminum  in  the  post-IC  facility.  This  may 
or  may  not  be  a  limitation.  This  is  the  process  we  chose  to  complete  our  first  attempt  at 
integration.  A  picture  of  a  completed  chip  is  shown  in  Figure  26,  and  an  abbreviated 
process  flow  is  shown  in  Figure  27, 
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Figure  26.  Picture  of  a  fabricated  integration  chip. 
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Figure  27.  Abbreviated  process  integration  process  flow. 

Control 

A  new  adaptive  controller  based  on  a  neural  network  was  constructed  and  applied 
to  our  system  of  drag  reduction.  A  simple  control  network,  which  employs  blowing  and 
suction  at  the  wall  based  only  on  the  wall-shear  stresses  in  the  spanwise  direction,  was 
shown  to  reduce  the  skin  friction  by  as  much  as  20%  in  direct  numerical  simulations  of  a 
low-Reynolds  number  turbulent  channel  flow.  Also,  a  stable  pattern  was  observed  in  the 
distribution  of  weights  associated  with  the  neural  network.  This  allowed  us  to  derive  a 
simple  control  scheme  that  produced  the  same  amoimt  of  drag  reduction.  This  simple 
control  scheme  generates  optimum  wall  blowing  and  suction  proportional  to  a  local  sum  of 
the  wall-shear  stress  in  the  spanwise  direction.  The  distribution  of  corresponding  weights 
is  simple  and  localized,  thus  making  real  implementation  relatively  easy.  Turbulence 
characteristics  and  relevant  practical  issues  are  also  discussed. 
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Although  a  neural  network  generally  requires  no  prior  knowledge  of  the  system  (or 
“plant”),  knowledge  about  the  near-wall  turbulence  structures  provides  a  guideline  for  the 
design  of  the  network  architecture.  Initially  dujdy  and  dwjdy  at  the  wall  at  several 
instances  of  time  were  used  as  input  data  fields  and  the  actuation  at  the  wall  was  used  for 
the  output  data  of  the  network.  Experimentally  we  found  that  only  dwjdy  at  the  wall  from 
the  current  instance  of  time  was  necessary  for  sufficient  network  performance.  Because 
we  wanted  the  output  to  be  based  only  on  a  local  input  area,  we  designed  our  network 
using  shared  weights.  The  network  had  a  single  set  of  weights  (a  template)  that  is 
convolved  over  the  entire  input  space  to  generate  output  values;  that  is,  we  used  the  same 
set  of  weights  for  each  data  point  and  the  training  involves  iterating  over  all  data  points. 
The  template  extracts  spatially  invariant  correlations  between  input  and  output  data.  The 
size  of  the  template  was  initially  chosen  to  include  information  about  a  single  streak  and 
streamwise  vortex,  and  then  was  varied  to  find  an  optimal  size. 

We  used  a  standard  two-layer  feedforward  network  with  hyperbolic  tangent  hidden 
units  and  a  linear  output  unit  (see  Figure  28).  The  functional  form  of  our  final  neural 
network  is: 


^jk 


(  ^2  Qyv 

JMi  ) 

-W^  ,  1  < and  l<k<N 


where  the  W's  denote  weights,  N  is  the  total  number  of  input  weights,  and  the  subscripts 
j  and  k  denote  the  numerical  grid  point  at  the  wall  in  the  streamwise  and  spanwise 


directions  respectively.  and  are  the  number  of  computational  domain  grid  points 

in  each  direction.  The  summation  is  done  over  the  spanwise  direction.  Seven  neighboring 
points  {N  =  7) ,  including  the  point  of  interest,  in  the  span\vise  direction  (corresponding  to 
approximately  90  \vall  units  vdth  our  numerical  resolution)  were  foimd  to  provide  enough 
information  to  adequately  train  and  control  the  near-wall  structures  responsible  for  the 
high-skin  friction.  Note  that  the  blowing  and  suction  are  applied  at  each  grid  location 
according  to  the  above  equation  as  a  numerical  approximation  of  distributed  blowing 
suction  on  the  surface.  A  scaled  conjugate  gradient  learning  algorithm  [29]  was  used  to 

produce  rapid  training.  For  given  pairs  of  (vyf  ,dwldy\j^,  network  was  trained  to 
minimize  the  sum  of  a  weighted-squared  error  given  by 

Error  =  ^  ^  ”  "^7  ) 

^  j  k 

where  v'*®  is  the  desired  output  value  and  v"®'  is  the  network  output  value  given  by 
equation  (1).  The  weights  were  initialized  with  a  set  of  random  numbers.  Note  that  the 
error  defined  in  (2)  exponentially  emphasizes  (proportional  to  A)  large  actuations.  This 
error  scaling  was  chosen  based  on  Choi  et  al.  's  [10]  observation  that  large  actuations  are 
more  important  for  drag  reduction.  Usually  within  100  training  epochs,  the  error  reached 
its  asymptotic  limit. 
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Figure  28.  Neural  Network  Architecture 

The  computed  flow  fields  for  a  no-control  case  and  a  successful  control  case,  based 
on  a  7-point  weighted  sum  of  dwldy\^  [30],  were  examined  to  investigate  the  mechanism 

by  which  the  drag  reduction  is  achieved.  The  most  salient  feature  of  the  controlled  case 
was  that  the  strength  of  the  near-wall  streamwise  vortices  was  drastically  reduced.  In 
Figure  29,  contours  of  streamwise  vorticity  in  a  cross  plane  are  shown.  This  result  further 
substantiates  the  notion  that  a  successful  suppression  of  the  near-wall  streamwise  vortices 
leads  to  a  significant  reduction  in  drag.  Note  that  for  the  controlled  case  the  wall 
actuations  were  applied  at  both  walls.  Also,  Figure  29(c)  shows  the  wall  shear  stress  for 


various  control  mechanisms. 


the  same  for  both  figures.  Negative  contours  are  chain-dotted,  (c)  Mean 
wall-shear  stress  histories  for  on-line  control  with  different  input  template 
sizes:  the  top  solid  line  is  no  control;  the  dotted  line  is  control  with  5x1; 
the  lighter  bottom  line  is  control  with  7x1;  the  darker  bottom  line  is 
control  with  9x1. 

The  probability-density  fimction  of  the  wall-shear  stress  in  the  streamwise 
direction  is  shown  in  Figure  30,  It  is  evident  that  the  control  is  very  effective  in 
suppressing  large  fluctuations,  thus  reducing  the  mean-skin  friction.  Furthermore,  the 
root-mean-square  (rms)  values  of  turbulent  fluctuations  in  the  wall  region  are  also  reduced 
as  shown  in  Figure  31(a).  The  same  trend  was  observed  by  Choi  et  a/.  [10]  The  rms 
vorticity  fluctuations  are  also  significantly  reduced,  except  for  very  close  to  the  wall. 

The  increase  for  the  latter  is  caused  by  additional  dvjdz  due  to  the  wall  actuation.  The  rms 
fluctuations  of  co^  at  the  wall,  which  is  mainly  due  to  dufdy  at  the  wall,  is  also  decreased. 
This  indicates  that  the  control  scheme  led  to  a  reduction  in  the  mean  shear  and  its  variance 
at  the  wall  by  suppressing  large  fluctuations  as  shown  in  Figure  30.  The  reduction  in  the 
rms  fluctuations  in  co^  and  Oy  indicates  that  the  control  scheme  indeed  reduced  the 
strength  of  the  near-wall  streamwise  vortices  and  the  wall-layer  streaks. 


Figure  30.  Probability-density  function  of  the  wall-shear  stress:  the  darker 
line  is  no  control;  the  lighter  line  is  control  with  7  fixed  weights.  The  area 
under  each  curve  is  normalized  to  one. 


w 


Figure  31.  Root-mean-square  fluetuations  normalized  by  the  wall 
variables:  solid  line  is  no  control;  dotted  line  is  control  with  7  fixed 
weights,  (a)  Velocity  fluctuations;  (b)  vorticity  fluctuations. 


Figure  32  compares  the  distribution  of  wall  actuation  used  in  our  control  vdth  that 
from  Choi  et  al.'s  [10]  v-control  using  the  information  at  y'^  =  10  for  the  same  wall-shear 


stress  distribution.  The  corresponding  wall-shear  stress  distribution  is  also  shown  in 


Figure  32.  The  wall  actuations  indicate  a  strikingly  similar  distribution  to  each  other,  even 
though  the  wall  actuation  of  our  control  is  based  only  on  the  wall-shear  stress  dwldy\^ . 

Basically  our  control  scheme  detects  edges  of  locally  high-shear  stress  regions  by 
measuring  the  spanwise  variation  of  dwjdy ,  and  applies  appropriate  wall  actuation  as 
shown  in  Figures  32(a)  and  32(b).  Since  high-shear  stress  regions  are  usually  elongated  in 
the  streamwise  direction,  only  spanwise  variation  is  necessary  for  detecting  the  edges. 
Since  dwjdy  is  a  direct  measurement  of  streamwise  vortices,  it  provides  more  appropriate 
information  than  dujdy .  This  is  consistent  with  our  finding  that  dwjdy  at  several  points 
in  spanwise  direction  are  enough  for  good  performance  of  control. 


Figure  32.  Contours  of  the  wall  actuation:  (a)  control  using  dwldy\_^  with 

7  fixed  weights;  (b)  control  using  information  at  ;^'^=10.  Negative 
contours  are  chain-dotted. 


Electronics 

Circuits  process  the  signals  from  the  sensors  to  find  regions  of  high  shear  stress. 
This  detection  process  uses  information  about  the  spatial  and  temporal  nature  of  the 


streaks.  F‘^“^‘„  ‘‘‘“  ‘^“‘“  ‘“‘“ 
templates  for  streak  detection.  We  organize  the  sensor  ou^uts  into  thin  feature  detectors 
oriented  in  the  direction  of  the  airflow.  When  several  sensors  in  a  coluirm  register  either  a 
larger  or  smaller  output  than  their  neighbors  in  a  spanwise  direction,  this  difference 
accumulates.  If  this  accumulated  difference  exceeds  a  threshold,  a  vortex  pair  streak  may 
be  present  in  that  column.  The  appropriate  control  action  raises  the  associated  actuator. 

Figure  33  shows  a  plot  of  the  detection  and  control  chip.  The  CT  ou^ut  sensor 
signal  feeds  into  a  further  stage  of  amplification.  A  buffer  distributes  the  amplified  signal 
to  a  non-linear  resistive  network  composed  of  the  HRES  circuits  [31].  Sensor  outputs  in 
the  same  column  and  sensor  ouQ)uts  in  adjacent  columns  use  different  spatial  filtering 
constants.  The  different  constants  serve  to  reinforce  activity  within  a  column  and 
discourage  activity  between  adjacent  columns.  The  filtered  signals  feed  to  a  symmetric 
anti-bump  circuit  [32].  The  circuit’s  operation  mimics  a  soft  comparator  with  an  adjustable 
dead  zone.  The  function  of  the  circuit  is  to  indicate  when  a  particular  column  has 
registered  a  large  shear  stress  value  while  the  neighboring  columns  have  not.  The  output  of 
the  anti-bump  circuit,  a  current,  accumulates  for  a  particular  column  and  compared  to  a 
threshold.  If  the  accumulated  value  exceeds  the  threshold,  the  circuit  triggers  the  actuator 
by  turning  on  a  pull-down  transistor. 


Figure  33.  Layout  of  circuitry  on  integrate  chip. 

We  designed  this  system  to  reduce  the  folly  turbulent  drag  in  our  experimental 


setup.  We  present  the  system  with  a  folly  turbulent  airflow  profile.  Figure  34  graphs  the 
single  column  response  of  the  system. 


Figure  34.  Graph  of  the  output  waveforms  of  one  shear  stress  sensor  and 
the  corresponding  channel  from  the  detection/control  chip.  We  record  the 
data  in  a  fully  developed  turbulent  flow. 


Conclusions 

As  can  be  seen  by  the  work  presented  herein,  the  effort  to  reduce  drag  in  turbulent 
boundary  layers  requires  the  collaboration  of  engineers  in  many  different  fields.  The 
individual  contributions  within  each  field  (MEMS,  fluids,  controls,  and  electronics)  are 
novel  even  as  a  stand-alone  effort.  When  combined  together,  such  efforts  can  lead  to 
tremendously  exciting  discoveries.  Much  work  needs  to  be  done  in  the  future  to  obtain  the 
desired  drag  reduction,  but  efforts  to  date  have  proven  to  be  extremely  promising  and 


worthwhile. 
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REVIEW:  MEMS  and  Its 
Applications  for  Flow  Control 

Emerging  micromachining  technology  enables  us  to  fabricate  mechanical  parts  on 
the  order  of  micron  size.  It  provides  us  with  micro-sensors  and  micro-actuators 
which  facilitate  the  exploration  of  all  areas  of  science.  Furthermore,  these  miniature 
transducers  can  be  integrated  with  microelectronics.  With  an  integrated  system,  it 
then  becomes  possible  to  complete  the  loop  of  sensing,  information  processing,  and 
actuation.  This  type  of  system  enables  us  to  perform  real-time  control  of  time  varying 
events  which  are  common  in  fluid  dynamics.  In  this  review  paper,  we  will  first 
briefly  introduce  Micro-Electro-Mechanical-Systems  (MEMS)  technology.  Then,  the 
applications  of  MEMS  to  flow  control  will  be  discussed. 


Introduction 

During  the  last  three  decades,  the  field  of  solid-state  electron¬ 
ics  kept  improving  the  manufacturing  technology  involved  in 
the  miniaturization  of  electronic  elements.  At  present,  it  is  possi¬ 
ble  to  make  electronic  feature  sizes  on  a  sub-micron  scale. 
The  integrated  circuit  (IC)  evolved  from  a  small  number  of 
transistors  on  a  chip  to  several  million  transistors  on  a  chip. 
The  range  of  applications  increased  dramatically  with  time  such 
that  the  IC  has  changed  our  lives.  However,  an  IC  cannot  inter¬ 
act  with  the  environment.  The  interaction  between  a  system  and 
the  external  world  has  to  rely  on  sensors  and  actuators  which 
are  usually  mechanical  devices.  In  the  past,  almost  all  of  the 
mechanic^  parts  had  sizes  much  larger  than  the  electronic  ele¬ 
ments  because  traditional  manufacturing  technology  had  dififi- 
culty  in  fabricating  small  mechanical  parts,  say  less  than  a  mm. 
Interestingly  enough,  Feynman  did  raise  the  question  of  making 
miniature  things  in  his  inspiring  paper  entitled  “There  is  plenty 
of  room  at  the  bottom”  (1956).  He  even  proposed  a  prize  to 
the  first  person  who  could  make  a  motor  of  ^  in.  cube.  Though 
the  prize  was  claimed  a  few  years  after  1959,  it  was  accom¬ 
plished  by  delicate  handicraft  not  by  a  mass  production  technol- 
o^.  In  die  1980s,  a  new  manufacturing  process,  the  surface 
micromachining  technology  (Howe  and  Muller,  1983),  became 
available.  It  could  be  used  to  manufacture  micron  sized  mechan¬ 
ical  devices  with  sophisticated  geometry.  An  example  is  the 
micro  motor  (Fan,  et  al.,  1988a),  which  is  shown  in  Fig.  1.  A 
piece  of  human  hair  is  also  shown  in  the  figure  to  indicate  the 
size  of  the  motor.  These  achievements  signified  the  beginning 
of  the  era  of  micromachinery,  which  will  be  as  valuable  as  the 
emergence  of  microelectronics  thirty  years  ago. 

The  significance  of  micromachine  technology  is  that  it  makes 
it  possible  to  provide  us  with  mechanical  parts 

1.  of  micron  size, 

2.  batch  fabricated  in  large  quantities  and 

3.  integrateble  with  electronics. 

Length  scale  matching  between  transducers  and  investigated 
phenomena  is  essential  so  that  enough  spatial  resolution  can 
be  achieved  for  sensing,  and  efficient  momentum  and  energy 
transfers  from  the  actuator  to  the  controlled  subject  can  happen. 
Our  scientific  knowledge  spans  the  length  scales  firom  light 
years  to  the  sub-atomic  domain.  Natural  phenomena  with  length 
scales  much  larger  than  the  size  of  human  beings,  such  as  the 
weather,  are  understood  mainly  through  sensing.  No  actuation 
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of  that  scale  is  possible.  At  the  other  extreme  is  the  atomic 
scale  which  we  can  explore  both  by  sensing  and  actuation. 
Accelerated  particles  are  used  as  actuators,  because  these  parti¬ 
cles  match  the  size  of  the  sub-atomic  world.  When  we  deal  with 
natural  phenomena  of  human  size,  there  are  abundant  transduc¬ 
ers  for  both  sensing  and  actuation.  However,  a  gap  exists  in  the 
spectrum  of  our  knowledge  for  scales  on  the  order  of  microns. 
We  do  not  know  very  much  about  the  nature  of  physics  in  this 
range  due  mainly  to  the  lack  of  transducers  of  this  size.  The 
miniature  mechanical  parts  fabricated  by  micromachining  tech- 
nolo^  are  going  to  fill  this  gap. 

Micromachining  is  a  batch  production  process  which  employs 
lithographic  processes.  Since  the  micromachining  technology  is 
derived  from  the  IC  technology,  it  is  possible  to  monolithically 
integrate  the  mechanical  and  electronic  elements.  The  micro- 
electro-mechanical-systems  (MEMS),  with  its  integrated  mi¬ 
cron-size  mechanical  parts  and  IC,  will  be  able  to  sense  the 
physical  world,  to  process  the  information,  and  to  then  manipu¬ 
late  the  physical  phenomena  through  actuators. 

There  are  numerous  opportunities  for  applying  MEMS  to 
flow  control.  It  is  known  for  example  that  shear  flows  are  sensi¬ 
tive  to  perturbations.  If  these  perturbations  are  exerted  at  the 
origin  of  the  shear  layer  and  are  within  the  instability  band, 
the  streamwise  development  of  the  flow  can  be  significantly 
modified  (Ho  and  Huang,  1982  and  Ho  and  Huerre,  1984). 
This  type  of  flow  control  scheme  has  been  demonstrated  to  be 
very  effective  on  free  shear  flow  because  it  takes  advantage  of 
the  flow  instability,  which  is  a  powerful  flow  amplifier.  This 
point  is  especially  important  for  micro  actuators  because  these 
actuators  cannot  deliver  large  forces  or  high  power. 

For  turbulent  wall-bounded  shear  flow,  we  face  different 
types  of  challenges.  The  flow  structures  responsible  for  viscous 
drag  increase  in  turbulent  flow  are  very  small  in  size,  typically 
several  hundred  microns  in  width  and  several  mm  in  length. 
Their  life  time  is  short,  in  the  millisecond  range  or  less.  Great 
difficulties  arise  from  the  fact  that  they  are  randomly  distributed 
in  time  and  space.  In  the  past,  direct  manipulation  of  these 
structures  was  very  difficult.  The  matching  in  length  scale  be¬ 
tween  the  micro  transducers  and  the  structures  makes  this  task 
possible. 

In  this  review  paper,  several  examples  of  micromachines  will 
first  be  presented.  Then,  the  manufacturing  technology  will  be 
briefly  summarized.  Finally,  we  will  discuss  the  science  issues 
unique  to  the  micron  dimension  and  flow  control  using  MEMS 
bas^  transducers. 

Micromachined  Devices 

MEMS  technology  provides  transducers  to  enable  us  to  per¬ 
form  sensing  and  actuation  in  various  engineering  applications. 
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Fig.  2  A  micro  beam  (Tai  and  Muller,  1988} 


Fig.  3  Micro  pin-joint  (Fan  et  a!.,  1988) 


In  the  near  future,  it  should  also  be  possible  to  sense  and  to 
manipulate  the  function  of  a  cell  through  transducers  (Najafi 
and  Wise,  1986).  It  is  interesting  for  us  to  understand  the  func¬ 
tions  of  neurons  which  facilitate  our  cognitive  ability,  our  bo¬ 
dy’s  motion  and  sensing.  Micromachines  makes  it  possible  to 
capture  a  single  neuron  in  a  micro  cavity  and  to  obtain  the 
electrical  signal  between  neurons  (Fig.  5,  Tatic-Lucic  et  al., 
1993).  It  is  even  possible  to  receive  the  neuron  signal  in  vivo 
by  inserting  micro  neuron  probes  into  the  brain  without  dis¬ 
turbing  much  the  neuron  due  to  the  probe’s  miniature  size  (Fig. 
6,  Tatic-Lucic  et  al.,  1994), 

In  the  field  of  material  science,  it  is  known  that  micron-sized 
subjects  are  much  stronger  than  large  subjects.  The  fracture 
strength  can  be  an  order  of  magnitude  larger  than  the  bulk 
fracture  strength.  Furthermore,  silicon  material  at  room  temper¬ 
ature  experiences  only  elastic  deformation  until  fracture.  Plastic 
deformation  is  not  obvious.  These  two  features  provide  a  new 
design  concept  for  micro  parts.  The  buckled  state,  in  a  large 
deformed  subject,  has  seldom  been  used  in  the  engineering 
design  of  macro  subjects  because  it  poses  the  risk  of  experienc¬ 
ing  plastic  deformation  and  even  the  possibility  of  fracture. 
Buckled  micro  parts  are  much  less  prone  to  these  problems.  As 


Figures  2  and  3  are  two  fundamental  micro  mechanical  ele¬ 
ments,  a  beam  and  a  pin  joint,  from  which  many  micromachines 
can  be  derived.  For  example,  the  micro  motor  (Fig.  1)  is  a 
combination  of  a  pin  joint  (rotor)  and  several  short  cantilever 
beams  (stators). 

Micro  transducers  also  contribute  to  the  advancement  of  sci¬ 
ence  by  enabling  us  to  study  the  natural  phenomena  that  occur 
in  the  order  of  microns  in  all  branches  of  science;  fluid  dynam¬ 
ics,  solid  mechanics,  materials  science,  biology,  etc.  The  push- 
pull  relationship  between  science  and  technology  further  en¬ 
riches  this  field.  Most  biological  cell  sizes  are  in  the  order  of 
microns  which  match  the  scale  of  micromachines.  The  micro 
gripper  in  Fig.  4  (Kim  et  al.,  1992)  shows  that  is  possible  to 
hold  a  single  cell  of  7  /zm  in  diameter  by  a  micro  device.  With 
some  imagination,  we  can  now  see  a  new  world  through  the 
picture. 


■W.:,  ■  .:V 


Fig.  5  Neurons  in  micro  cavities  (Tatic-Lucic  et  al.,  1993) 
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Fig.  6  Neuron  probe  (Tatic-Lucic  et  al.,  1994) 


a  matter  of  fact,  several  designers  have  already  created  devices 
that  utilize  these  advantages  (Ataka,  et  al.,  1994;  Matoba,  et 
al.,  1994). 

In  the  fluid  mechanics  arena,  turbulence  research  can  benefit 
from  the  micro  transducers.  The  spread  of  the  length  scales  is 
a  function  of  Reynolds  number.  Tlie  fine  dissipation  scale  in 
very  high  Reynolds  number  flows  requires  a  time  response  in 
the  order  of  fis  and  a  spatial  resolution  in  the  order  of  fxm.  The 
traditional  hot-wire  has  a  spatial  resolution  of  about  one  mm 
which  is  three  orders  of  magnitude  larger  than  the  requirement. 
Its  time  constant  is  about  one  order  of  magnitude  away  from  the 
requirement.  Transducers  made  by  micromachining  technology 
make  it  possible  to  meet  these  requirements.  Using  the  surface 
micromachining  technique,  a  micron-sized  hot-wire  anemome¬ 
ter  was  made  from  doped  polysilicon  (Jiang  et  al.,  1994a  and 
1994b).  When  operated  in  constant  temperature  mode,  a  one 
mega-Hertz  bandwidth  is  possible.  Due  to  the  flexibility  of  the 
geometric  design  by  lithographic  techniques,  manufacturing  a 
multiple  sensor  array  is  not  a  painstaking  process  anymore  (Fig. 
7). 


Micro  transducers  are  already  available  for  industrial  applica¬ 
tions.  The  automobile  air-bag  sensor  (Fig.  8)  developed  by 
Analog  Devices  is  used  to  sense  the  deceleration  during  an 
automobile  accident  to  activate  the  air-bag.  Monolithically  inte- 


Fig.  7  Hot-wire  rake  (Jiang  et  al.,  1994) 


Fig.  8  Air  bag  sensor  (Analog  Devices) 


grated  sensor-microelectronics  (Fig.  8)  are  used  for  the  detec¬ 
tion  of  the  capacitance  variation  of  the  comb  structure  (Fig.  9) 
due  to  impact,  signal  conditioner,  calibration  and  many  other 
functions.  The  integrability  between  sensor  and  electronics  sim¬ 
plify  the  packaging  process.  The  lithographic  process  can  repro¬ 
duce  large  numbers  of  integrated  micro-sensors  and  decision 
circuits  on  a  wafer  and  therefore  greatly  reduce  the  unit  cost. 
The  micro  optical  mirror  (Fig.  10)  developed  by  Texas  Instru¬ 
ment  illustrates  a  new  concept  for  light  beam  steering.  In  a 
fairly  small  area,  2.3  cm^,  about  400  thousand  individually 
addressable  mirrors  are  used  to  reflect  images  onto  a  large 
screen  for  a  high  definition  television  projection.  These  mirrors 
have  high  optical  efficiency  and  at  least  a  15  year  life  span.  This 
is  an  example  of  using  a  large  number  of  distributed  actuators  to 
accomplish  a  complicated  light  steering  task. 

The  above  mentioned  examples  are  a  very  small  fraction  of 
the  available  micromachines.  Intensive  research  efforts  in  the 
last  few  years  have  brought  forth  many  new  devices  as  well  as 
new  processing  techniques.  Three  basic  manufacturing  tech¬ 
niques  are  introduced  in  the  following  sections. 

Micromachining  Technology 

Micromachining  is  the  use  of  a  sequence  of  microfabrication 
processes  to  make  micromechanical  components.  As  a  result, 
the  fundamentals  of  micromachining  are  very  different  from 
conventional  machining.  In  fact,  because  of  the  use  of  micro- 
fabrication  processes,  micromachining  has  generally  been  re¬ 
garded  as  being  derived  from  integrated-circuit  (IC)  technol¬ 
ogy.  There  is  no  doubt  that  micromachining  and  IC  technology 
do  have  many  similarities.  As  an  example,  micromachining  also 
relies  heavily  on  the  use  of  lithographic  methods  to  first  form 
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Fig.  10  Micro  optic  mirror  (Texas  Instrument) 


pre-designed  resist  patterns  (or  masks)  and  then  selectively  etch 
the  undesirable  part  away.  Nevertheless,  they  still  have  many 
differences  and  one  important  difference  is  that  in  micromachin¬ 
ing,  selective  etching  is  used  purposely  to  create  3-D  structures 
while  in  IC  fabrication,  distinctively  3-D  structures  must  be 
avoided  so  as  to  facilitate  the  process  of  making  smaller  and 
smaller  feature  sizes  on  a  planar  wafer  surface.  Interestingly,  it 
is  then  these  3-D  or  even  free-standing  structures  that  make 
micromachining  useful  as  they  have  numerous  micromechanical 
applications. 

Since  its  beginning,  micromachining  has  already  developed 
into  many  branch  technologies.  Among  them,  bulk  microma¬ 
chining,  surface  micromachining,  and  LIGA  technology  are  the 
most  important  ones.  We  will  discuss  them  in  the  following 
section,  but  only  briefly.  For  more  information,  the  readers  are 
referred  to  some  excellent  review  papers.  (Petersen,  1982;  Sei¬ 
del,  1987;  O’Connor,  1992;  Bryzek  et  al.,  1994) 

Bulk  Micromachining.  A  general  definition  of  bulk  micro- 
machining  is  to  microfabricate  microstructures  from  a  starting 
substrate  by  removing  the  unwanted  portion  of  it.  Accordingly, 
there  are  many  possible  combinations  of  choosing  the  starting 
substrate  materials  and  the  means  of  material  removal.  The  main 
stream  of  bulk  micromachining,  however,  usually  implies  that 
the  substrate  is  a  single-crystalline  silicon  wafer  and  the  material 
removal  is  done  by  selective  wet-chemical  etching,  either  iso¬ 
tropically  or  anisotropically.  Moreover,  the  selective  etching  of 
silicon  is  often  through  the  use  of  photolithographic  masking 
in  conjunction  with  etch-stop  techniques  such  as  using  a  heav¬ 
ily-doped  boron  layer  (Raley,  1984)  or  a  silicon  p-n  junction 
(Jackson,  1981).  Altogether,  many  silicon  microstructures  can 
be  made  induing  cantilevers,  beams,  diaphragms,  channels, 
nozzles,  etc.  These  microstructures  have  formed  the  building 
blocks  of  many  MEMS  devices. 

Chronologically,  the  development  of  silicon  chemical  etch¬ 
ants  has  been  documented  since  the  1950’s  as  researchers  tried 
to  find  a  precise  way  of  etching  silicon  (Robbins,  1959).  The 
developed  aqueous  silicon  etchants  were  mainly  isotropic  and 
based  on  the  mixture  of  hydrofluoric  acid,  nitric  acid,  and  acetic 
acid  (HNA).  Anisotropic  etchants  for  silicon  started  in  the 
1960’s.  The  most  studied  etchants  were  potassium  hydroxide 
(KOH)  (Times,  1967),  ethylene-diamine-pyrocatechol  with 
water  (EDP  or  EPW)  (Fiime,  1967),  and  hydrazine  solutions 
(Lee,  1969).  The  anisotropic  feature  of  these  etchants  is  that 
they  etch  in  the  ( 100)  and  ( 1 10)  silicon  crystallographic  direc¬ 
tions  much  faster  than  in  the  (111)  direction,  which  allows  the 
design  of  microstructures  to  be  naturally  bounded  by  {111} 
crystalline  planes.  Figure  11  then  shows  the  typical  cases  of 
both  isotropic  and  anisotropic  bulk  micromachining.  Note  that 
etching  happens  only  when  the  surface  is  not  covered  by  the 
patterned  mask.  For  the  case  of  anisotropic  etching,  different 
surface  orientations,  either  (100)  or  (110),  will  give  totally 


different  geometeries  although  the  saturating  walls  are  all  { 1 1 1 } 
planes. 

This  feature  was  broadly  used  and  these  anisotropic  etchants 
were  further  refined  in  1970s  and  1980s  (Kern,  1978;  Bassous, 
1978;  Bean,  1978;  Reisman,  1979;  Kendall,  1985).  Recently, 
even  metal-exclusive  anisotropic  etchants  of  ammonium  hy¬ 
droxide  (Schnakenberg,  1991a)  and  tetramethyl  ammonium  hy¬ 
droxide  (TMAH)  (Schnakenberg,  1991b,  Tabata,  1991)  have 
been  developed.  As  more  and  more  MEMS  devices  are  inte¬ 
grated  with  electronics,  these  etchants  have  become  more  im¬ 
portant.  Today,  it  is  generally  agreed  that  bulk  micromachining 
using  wet  chemical  etching  is  a  mature  technology  and  many 
MEMS  devices  are  fabricated  using  it 

In  addition  to  wet  chemical  etching,  recently  people  also 
started  to  look  at  dry  etching  processes  that  can  replace  or  even 
outperform  it.  For  example,  laser  etching  (Bloomstein,  1994), 
plasma  etching  (Miu,  1993),  gas  etching,  reactive-ion  etching 
(Linder,  1991),  ion  beam  etching  and  even  micro  electro-dis¬ 
charge-machining  (EDM)  all  have  been  demonstrated.  Bulk 
micromachining  using  dry  etching  is  not  limited  to  special  crys¬ 
talline  orientations  and  there  will  be  no  surface-tension  prob¬ 
lems  as  often  encountered  in  wet  etching.  However,  in  order 
for  dry  bulk  micromachining  to  become  practical,  a  lot  of  re¬ 
search  still  needs  to  be  done. 

Surface  Micromachining.  As  the  name  suggests,  surface 
micromachining  is  performed  on  the  surface  of  a  substrate, 
which  can  be  a  piece  of  silicon,  glass,  alumina,  or  even  metal. 
This  substrate,  however,  only  functions  as  a  mechanical  support 
and  it  does  not  participate  in  the  processing.  The  micromachin¬ 
ing  then  involves  combinations  of  thin-film  deposition  and  pat¬ 
terning.  In  the  end,  a  selective  etch  will  take  away  certain  layers 
(called  sacrificial  layers)  and  leave  other  layers  (called  struc¬ 
tural  layers)  standing  free.  Figure  12  demonstrates  a  simplified 
process  of  surface  micromachining. 

According  to  literature,  this  concept  of  surface  micromachin¬ 
ing  was  readily  demonstrated  in  the  1950’s  [US  Patent  1956]. 


Si0|2  or  Si3M4 


Fig.  11  Typical  cases  of  silicon  bulk  micromachining  (a)  anisotropic 
etching  of  (100)  SI  wafer,  (b)  anisotropic  etching  of  (110)  wafer,  (c) 
isotropic  etching 
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Fig.  12  A  scheinatic  process  to  demonstrate  the  surface  micromachin¬ 
ing  technology.  Note  that  the  overhang  structure  in  (d)  can  be  part  of  a 
cantilever  beam,  a  bridge,  a  plate,  or  even  a  diaphragm. 


source  and  the  cost  of  the  technique  made  others  develop  similar 
techniques  but  with  ultraviolet  (UV)  light  sources  (Frazier, 
1992),  available  in  any  IC  manufacturing  laboratory.  Light- 
heartedly  termed  as  “poor-man’s  LIGA”  or  simply  “cheap 
LIGA,”  this  UV-based  micro-electroplating  gained  popularity 
in  many  areas  and  is  used  to  fabricate  motors,  flaps,  and  chan¬ 
nels  (Frazier,  1991;  Miu,  1993;  Joo,  1995). 


Design  of  Micromachines 

A  machine  consists  of  several  basic  requirements:  configura¬ 
tion,  motion,  function  and  efficiency.  Each  machine  part  has 
its  specific  geometry  and  moves  in  a  required  manner.  The 
combination  of  configuration  and  motion  makes  the  machine 
capable  of  carrying  out  its  function.  The  efficiency  or  perfor¬ 
mance  of  the  machine  depends  on  its  design. 

A  micromachine  also  has  to  satisfy  the  same  requirements. 
However,  the  design  is  much  more  challenging  because  the 
governing  physical  mechanism  might  change  in  the  micro  scale. 
The  drastic  size  reduction  in  the  micro  world  results  in  the 
breakdown  of  the  scaling  laws  used  in  traditional  engineering 
fields  because  the  large  length  scale  change  is  significant  enough 
such  that  the  equations  governing  the  momentum  and  energy 
balances  need  to  be  re-examined.  In  addition,  the  inter-link 
among  these  fields  becomes  important  in  micron  scales,  such 
that  the  surface  physics  and  chemistry  may  significantly  affect 
the  boundary  conditions  in  fluid  and  solid  mechanics.  The 
changes  in  flie  governing  equations  and  boundary  conditions 
prevent  us  from  using  the  traditional  approaches  which  were 
well  developed  in  the  past.  Scaling  from  the  macro  to  the  micro 
world  is  questionable.  Design  rules  developed  for  the  macro 


However,  the  use  of  it  to  make  a  complete  MEMS  resonant- 
gate  transistor  was  done  much  later  by  Nathanson  ( 1967 ) .  After 

•  this  paper,  a  series  of  surface-micromachined  devices  were  pub¬ 
lished  including  the  early  work  of  the  digital  mirror  display 
(Preston,  1972).  But,  it  was  not  until  1982  that  Howe  (1982) 
used  LPCVD  polycrystalline  silicon  to  make  micro  cantilevers 
and  bridges.  The  significance  of  this  polysilicon  surface  micro- 
machining  process  is  that  it  is  compatible  with  IC  technology. 
The  following  work  at  Berkeley  in  the  late  1980s  (Fan,  1988; 
Tai,  1989;  Tang,  1989;  Kim,  1992)  then  demonstrate  the  broad 
use  of  polysilicon  micromachining  for  various  devices  such  as 
micro  beams  (Fig.  2),  pin-joints  (Fig.  3),  sliders,  resonators, 
tweezers  (Fig.  4),  and  micromotors  (Fig.  1 ) .  Today,  polysilicon 
micromachining  has  established  itself  as  one  of  the  most  im¬ 
portant  branches  of  surface  micromachining.  In  addition  to  poly¬ 
silicon  surface  micromachining,  researchers  have  also  explor^ 
many  other  materials  such  as  sduminum  (Sampsell,  1993)  and 
silicon  nitride  (Tabata,  1991).  Surface  micromachining  contin¬ 
ues  to  flourish  as  MEMS  devices  become  smaller,  lighter,  faster 
and  cheaper. 

LIGA  Technology.  Finally,  there  also  is  the  LIGA  technol¬ 
ogy  (a  German  acronym  for  LlthograpHie  Galvanoformung  An- 
formung)  (Becker,  1986;  Ehrfeld,  1988).  Basically,  the  LIGA 
process  uses  X-ray  lithography  to  generate  a  deep  pattern  on  a 
conductive  substrate.  The  empty  space  in  the  X-ray  resist  then 
is  electroplated  from  the  bottom  of  the  substrate  to  the  top  of 
the  resist  (Fig.  13),  which  generates  a  negative  replica  of  the 
resist  mold.  This  allows  structures  with  submicron  resolution 
from  the  capability  of  X-ray  lithography.  The  microstructures 
can  actually  be  one  millimeter  thick,  and  with  a  submicron 
precision,  there  is  no  other  micromachining  technique  that  can 
match  up  with  it.  However,  the  drawback  is  the  cost  for  the  X- 

•  ray  source  which  requires  a  high-energy  synchrotron.  Because 
of  this,  LIGA  may  not  be  a  popular  technology  but  it  is  useful 
for  some  special  applications  which  require  a  fine  spatial  resolu¬ 
tion  and  a  high  aspect  ratio.  Otherwise,  the  rarity  of  the  X-ray 


metal  cantilever 


Si  wafer 


Fig.  13  Micro  electro-plating 
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Fig.  14  Micro  cilia  (Ataka  et  al.,  1994) 


scale  will  not  be  valid  for  the  micro  scale.  We  need  to  develop 
new  intuition  to  design  micromachinery. 

Relative  motion  between  parts  is  essential  for  most  machines. 
However,  two  micromachine  parts  designed  to  have  a  certain 
type  of  motion  might  not  perform  according  to  the  desired  way 
due  to  the  large  friction  between  two  solid  surfaces,  A  well- 
known  example  (Tai  and  Muller,  1989)  is  that  the  rotor  of  a 
micro  motor  will  not  rotate  if  the  rotor  is  in  full  contact  with 
the  substrate.  If  dimples  are  placed  between  the  rotor  and  the 
substrate  to  reduce  the  contact  area,  rotation  becomes  possible. 
This  result  leads  to  the  conclusion  that  the  friction  force  depends 
on  the  contact  area  which  should  not  be  a  critical  parameter  in 
the  conventional  physics.  It  suggests  that  friction  force  in  the 
micron  scale  has  undergone  a  fundamental  change.  In  general, 
the  frictional  force  is  a  sum  of  the  surface  force  and  the  contribu¬ 
tion  from  the  body  force.  In  the  macro  scale,  the  body  force  is 
much  larger  than  the  surface  force.  Therefore,  the  frictional 
coefficient  is  a  function  of  the  body  force  normal  to  the  contact 
surface  and  not  a  function  of  the  contact  area.  In  the  force 
balance  equation,  the  frictional  force  therefore  has  a  simple 
expression.  For  surface  micromachined  subjects,  the  dimension 
normal  to  the  wafer  surface  is  typically  two  orders  of  magnitude 
smaller  than  the  transverse  length  scale.  This  results  in  the 
volume  per  unit  contact  area  of  the  micro  subjects  being  much 
smaller  than  that  of  the  macro  subjects.  Hence,  surface  force 
dominates  in  the  frictional  force  in  the  motion  of  micro  subjects. 
However,  the  scientific  nature  of  various  surface  forces  which 
contribute  to  the  frictional  force  are  largely  unknown  so  the 
analytical  form  of  the  frictional  force  cannot  be  expressed  ex¬ 
plicitly  or  implicitly  in  writing.  This  fact  leaves  an  unknown 
term  in  the  equation  of  motion  of  the  micro  rigid  body.  The 
role  switch  between  the  body  force  and  the  surface  force  cer¬ 
tainly  poses  a  significant  burden  in  designing  a  micromachine. 

Due  to  the  difference  in  governing  mechanisms,  we  might 
also  have  to  change  the  basic  design  principle.  Most  of  the 
geometry  used  in  the  micromachines,  e.g,  gear  and  rotor,  are 
still  a  size-reduced  emulation  from  the  macro  world.  It  is  not 
obvious  that  the  direct  emulation  is  the  most  appropriate  design, 
especially  the  ones  involving  relative  motion  between  large 
surfaces.  Due  to  the  large  friction  force,  the  efficiency  of  micro¬ 
machines  involving  sliding  motion  cannot  have  high  efficiency. 
Therefore,  a  new  design  is  needed  when  a  subject  is  to  be 
transported  from  one  place  to  another.  Simulating  the  transport 
mechanism  of  the  cilia,  Ataka  et  al  (1994)  made  an  array  of 
flexible  silicon  cilia  (Fig.  14) .  By  thermal  actuation,  the  exten¬ 
sion-contraction  cycle  can  move  a  solid  subject  sitting  on  the 
top  of  the  array.  This  design  can  transport  a  subject  without 
encountering  the  large  friction  problem  in  the  micro  sliding 
surface. 

Micro  Transducers  for  Flow  Control 

The  small  physical  size  associated  with  the  miniature  trans¬ 
ducers  improves  the  spatial  resolution.  The  temporal  response 
can  easily  be  increased  to  a  large  extent  because  of  the  low 


inertia  of  the  transducer.  These  transducers  have  obvious  advan¬ 
tages  for  flow  control  problems  (McMichael,  1996).  Jacobson 
and  Reynolds  ( 1993 )  used  resonant  beam  actuators  to  examined 
the  effect  on  the  surface  shear  stress  produced  by  a  stationary 
vortex  pair.  Smith  and  Glezer  (1995)  have  demonstrated  the 
concept  of  applying  a  micro  jet  to  achieve  thrust  vectoring.  On 
the  other  hand,  the  capability  of  providing  large  quantities  of 
sensors  and  actuators  ilows  us  to  perform  distributed  control. 
This  potential  opens  a  new  horizon  in  many  engineering  applica¬ 
tions  and  remains  to  be  explored.  Its  impact  could  even  be  more 
profound  than  the  size  reduction  of  transducers. 

Micro  Hot-Wire  Anemometer.  Since  the  spatial  resolu¬ 
tion  of  the  anemometers  for  flow  measurement  is  determined 
by  their  physical  dimensions,  it  is  advantageous  if  the  wire  size 
can  be  reduced.  In  addition,  miniature  hot-wires  also  reduce 
power  consumption  and  thermal  interference  to  the  flow.  In  the 
past,  many  anemometer  designs  have  been  demonstrated  using 
either  surface  or  bulk-micromachining.  However,  they  are  all 
in  the  form  of  a  wire  adjacent  to  the  surface  of  a  chip.  None 
of  them  consists  of  a  wire  free-standing  in  space  without  any 
nearby  structure  so  that  undisturbed  free  space  velocity  can  be 
measured.  We  have  developed  a  micromachined  anemometer 
that  has  the  configuration  of  a  conventional  hot-wire  anemome¬ 
ter  but  with  a  greatly  reduced  wire  size  (Jiang  et  al.,  1994a  and 
1994b).  Not  only  are  the  spatial  resolution  and  device  sensitiv¬ 
ity  better,  the  frequency  response  is  also  improved  by  at  least 
one  order  of  magnitude  over  the  conventional  hot-wire  ane¬ 
mometers. 

The  micromachined  hot-wire  anemometer  is  made  of  a  poly¬ 
silicon  wire.  The  typical  sensor  wires  are  about  0.5  jtzm  thick, 
1  fim  wide,  10-160  fim  long.  The  dynamic  performance  and 
sensitivity  of  the  micromachined  hot-wire  were  tested.  A  heatr 
ing  time  of  2  fis  and  a  cooling  time  of  8  pts  for  the  30  fim  long 
probe  in  constant  current  mode  have  been  obtained.  For  constant 
temperature  operation,  a  time  constant  of  0.5  fis  for  the  10  fxm 
long  probe  can  be  reached.  The  corresponding  cut-off  frequency 
is  1.4  MHz.  A  calibration  curve  of  the  20  /im  long  micromach¬ 
ined  hot-wire  is  shown  in  Fig.  15.  The  average  sensitivity  is  20 
mV/m/s  at  a  heating  current  of  0.5  mA. 

Micro  Surface  Shear  Stress  Sensor.  Conventional  hot- 
film  type  surface  shear  stress  sensors  have  been  used  for  several 
decades  but  the  heat  transfer  to  the  substrate  has  always  been 
a  handicap  in  their  operation.  For  measurements  in  air,  the 
sensitivity  is  close  to  zero  because  of  the  low  heat  capacity  of 
air.  By  using  surface  micromachining  technology,  a  vacuum 
chamber  is  placed  under  the  heating  element,  as  shown  schemat- 


Fig.  15  Calibration  of  a  20  /cm  long  micromachine  hot-wire  sensor  at 
two  different  angles 
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Fig.  16  Schematic  of  the  top  (a)  and  cross-sectional,  (b)  views  of  the 
micro  shear-stress  sensor,  (c)  SEM  picture  of  the  micro  shear-stress 
sensor 


ically  in  Fig.  16.  This  cavity  significantly  decreases  the  heat 
loss  to  the  substrate  and  increases  the  shear  stress  sensitivity 
(Liu  et  al.,  1994,  Huang  et  al.,  1995  and  1996). 

A  silicon  nitride  film  is  used  as  the  diaphragm  material  which 
supports  the  heating  element.  The  chamber  is  formed  by  etching 
away  a  sacrificial  layer  underneath  the  diaphragm.  The  heating 
element  is  made  of  a  uniformly  doped  poly  silicon  resistor.  The 
aluminum  metalization  forms  the  metal  leads. 

The  typical  sensitivity  of  the  sensor  with  a  cavity  underneath 
is  15  mV/Pa,  which  is  about  one  order  of  magnitude  higher 
than  that  without  a  cavity.  Figure  17  is  a  calibration  curve  of 
the  shear  stress  sensor,  which  is  plotted  with  the  square  of  the 
output  voltage  versus  the  j  power  of  the  shear  stress.  A  linear 
relationship  is  obtained.  The  typical  cut-off  frequency  of  the 
sensor  is  10  kHz  when  operating  in  the  constant  temperature 
mode.  Another  approach  is  to  directly  measure  the  shear  stress 
by  sensing  the  force  on  a  micro  floating  element  flush  mounted 
on  the  wall  (Shajji  et  al.,  1992,  Pan  et  al.,  1995). 


Micro  Pressure  Sensor.  The  pressure  sensor  can  be  de¬ 
signed  and  fabricated  by  a  similar  technology  as  the  aforemen¬ 
tioned  shear  stress  sensor.  A  deposited  polysilicon  film  is  pat¬ 
terned  as  the  piezoresistor  for  strain  measurement  (Liu  et  al., 
1993).  On  a  250  X  250  fxw?  silicon  nitride  diaphragm,  there 
are  four  polysilicon  resistors  placed  around  the  center  and  an¬ 
other  four  on  the  edge  of  the  diaphragm.  The  total  thickness  of 
the  diaphragm  is  1.5  fim.  Polysilicon  resistors  are  buried  inside 
the  diaphragm.  The  thickness  of  the  polysilicon  resistors  is  500 
nm.  The  polysilicon  resistors  are  connected  to  form  a 


Fig.  17  Micro  surface  shear  stress  sensor  output  versus  shear  stress 
at  constant  temperature  mode 


GNO 

Fig.  18  A  micromachined  pressure  sensor 


Wheatstone  bridge  for  temperature  compensation  although  the 
heavily  doped  polysilicon  shows  a  very  small  temperature  coef¬ 
ficient.  The  output  of  the  Wheatstone  bridge  is  proportional 
to  the  pressure  exerted  on  the  diaphragm.  Figure  18  shows  a 
micromachined  pressure  sensor. 

The  pressure  sensors  are  fabricated  by  surfacing  microma¬ 
chining  which  is  compatible  with  other  MEMS  devices.  We 
have  already  integrated  this  kind  of  pressure  sensor  into  a  micro¬ 
flow  measurement  system  (Pong  et  al.,  1994;  Shih  et  al.,  1995). 
The  sensitivity  of  this  sensor  is  about  245  //V/V-psi. 

Micro  Actuator.  In  flow  control  problems,  we  need  actua¬ 
tors  which  are  able  to  provide  about  one  mm  off-plane  motion 
and  about  milli-Newton  force  per  mm^  for  sustaining  the  wind 
loading.  These  requirements  are  several  orders  of  magnitude 
higher  than  that  available  from  existing  micro  actuators.  In  order 
to  accomplish  these  stringent  performances,  we  first  select  an 
actuator  which  has  simple  configuration  so  that  we  at  least 
will  not  be  overburdened  by  fabrication  difficulties.  A  flap  type 
actuator  was  chosen  for  the  design.  It  obviously  is  fragile,  sensi¬ 
tive  to  the  wind  load  direction  as  well  as  has  many  other  defi¬ 
ciencies.  However,  it  does  serve  the  purpose  for  concept  demon¬ 
stration  in  laboratory  tests  (Liu  et  d.,  1994;  Tsao  et  al.,  1994; 
Liu  et  al.,  1995a  and  l995b).  On  the  other  hand,  these  flap 
actuators  have  many  other  applications,  such  as  large  angle  light 
steering  and  heat  transfer  enhancement. 

The  integration  of  permalloy  with  surface-micromachined 
polysilicon  flaps  has  been  completed  (Fig.  19).  Such  a  flap 
uses  polysilicon  as  a  structural  material  for  both  its  beam  sup¬ 
ports  and  plate.  Permalloy  (5  ^m  thick)  is  electrodeposited  on 
the  top  of  the  plate  to  provide  the  actuation  mechanism. 

Two  size  flaps,  1x1  mm^  and  2X2  mm^,  have  been  made 
and  bending  angles  up  to  80  deg  have  been  achieved  ^th  a 
magnetic  flux  density  of  about  800  Gauss.  Such  a  large  bending 
angle  equals  a  point  force  of  0.2  mN  acting  on  the  edge  of  the 
flap.  These  flaps  have  been  mounted  on  the  leading  edge  of  a 
delta  wing  and  have  been  proven  to  be  useful  for  aerodynamic 
control  of  the  wing. 

In  addition  to  the  permalloy  flaps,  we  have  also  developed  a 
bulk-micromachined  flap  with  both  copper  coils  and  permalloy 
(Fig.  20)  so  the  control  of  the  device  has  more  alternatives. 
The  active  flap  is  fabricated  by  the  integration  of  bulk  microma- 


Actuated  Micro-Flap 


Fig.  19  A  magnetically  actuated  surface-micromachined  permalloy  flap 
in  a  B-field  of  400  Gauss 
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Rg.  20  An  active  flap  integrated  with  both  coil  and  permalloy.  Both  up 
and  down  positioning  are  achieved. 


chining  techniques  with  magnetic  thin  film  processes.  The  me¬ 
chanical  structure  consists  of  a  40  jjm  thick,  4  mm  by  4  mm 
single  crystal  silicon  plate,  attached  to  bulk  silicon  through  two 
50  ^m  wide  S-shaped  springs.  The  electromagnetic  component 
of  the  actuator  sits  on  the  plate  surface  and  consists  of  a  30- 
tum  planar  copper  coil,  on  top  of  which  is  a  10  //m  triangular 
permalloy  (Ni8oFe2o)  layer.  Controllable  deflection  of  the  flap 
up  to  ±60®  has  been  achieved  by  both  external  fields  and  coil 
current. 

Control  of  Macro  Aerodynamic  Device  by  Micro 
Transducers 

In  flow  control  problems,  it  is  desirable  to  be  able  to  globally 
alter  the  flow  by  applying  minute  amounts  of  energy.  We  are 
interested  in  exploring  the  possibility  of  manipulating  a  macro 
aerodynamic  device  by  using  microactuators.  The  key  point  is 
finding  a  coupling  between  the  macro  physical  world  and  the 
micro  actuation.  For  a  delta  wing  at  high  angle  of  attack,  a  large 
fraction  of  the  lift  is  contributed  by  the  vortices  originated  from 
boundary  layers  around  the  two  leading  edges.  The  coupling 
here  is  in  the  thin  boundary  layer  before  separation.  If  the  off- 
plane  displacement  of  the  micro  actuator  matches  the  boimdary 
layer  thickness  before  flow  separation,  we  will  be  able  to  either 
change  the  separation  location  or  vary  the  vorticity  of  the  delta 
wing’s  leading  edge  vortices.  Once  the  symmetry  of  the  two 
vortices  is  broken,  it  is  possible  to  obtain  a  rolling  torque  or 
even  other  components  of  torque  and  force. 

A  delta  wing  model  with  a  sweep  angle  of  56.5  deg  was 
employed  for  the  test  (Fig.  21).  The  delta  wing  model  had 
rounded  leading  edges.  Micromachined  actuators  and  micro 
shear  stress  sensors  were  placed  on  a  cylindrical  rod  located  at 
the  leading  edge.  The  rod  can  be  rotated  such  that  the  positions 
of  these  transducers  could  be  changed  to  cover  the  leading  edge 
area.  Microactuators  with  a  flap  size  on  the  order  of  mm  have 
been  distributed  from  the  apex  to  the  trailing  edge.  An  open- 
type  wind  tunnel  was  used  for  aerodynamic  tests.  The  test- 


Fig.  22  Effect  of  micro  actuators  (both  sides)  on  rolling  moment  of  delta 
wing  at  AOA  of  25^ 


section  was  100  cm  X  100  cm  and  730  cm  long.  The  speed  of 
this  wind  tunnel  was  from  10  m/s  to  45  m/s.  A  six  component 
force /moment  transducer  was  used  to  measure  aerodynamic 
forces  and  moments.  However,  only  rolling  moment  was  inves¬ 
tigated  at  present. 

After  a  detailed  survey  of  the  surface  shear  stress,  we  can 
correlate  the  effective  actuation  region  with  the  flow  separation 
region  around  the  leading  edge.  The  most  exciting  finding  is 
that  the  rolling  torque  obtained  by  the  actuation  of  the  micro 
flaps  can  be  a  very  significant  amount  compared  with  the  toque 
contributed  by  the  leading  edge  vortices.  We  use  the  pressure 
(suction)  produced  by  one  of  the  leading  edge  vortices 
multiplied  by  the  distance  between  the  pressure  center  and  the 
center  chord  as  the  reference  torque,  T^.  Based  on  our  recent 
experimental  data,  the  amount  of  the  torque  produced  by  the 
microactuators  can  be  on  the  order  of  50  percent  of  Tr  (Fig. 
22). 

Large  Scale  Integrated  System  for  Distributed  Control 

For  a  phenomenon  with  a  wide  spread  of  length  scales,  e.g., 
turbulence,  a  large  scale  integrated  transducer  system  can  facili¬ 
tate  communication  of  the  local  information  measured  by  indi¬ 
vidual  sensors  as  well  as  identify  the  global  characteristics 
through  a  collection  of  the  sensors.  The  actuators  then  can, 
based  on  the  local  and  global  information,  execute  the  proper 
control  strategy. 

This  concept  is  best  illustrated  by  research  in  turbulence  con¬ 
trol.  Surface  shear  stress  reduction  in  the  turbulent  boundary 
layer  is  the  grand  challenge  in  fluid  mechanics  research.  Many 
randomly  distributed  small  flow  structures  in  the  boundary  layer 
are  responsible  for  drag  production.  The  width  of  these  struc¬ 
tures  at  high  Reynolds  number  is  only  about  several  hundred 
microns,  much  smaller  than  conventional  sensors  and  actuators. 
Furthermore,  a  large  number  of  sensors  spread  around  the  sur¬ 
face  are  necessary  to  detect  these  random  features  and  then  the 
actuators  can  interactively  manipulate  these  vortices.  Micro¬ 
machine  based  transducers  are  small  in  size  as  well  as  capable 
of  forming  large  scale  integrated  systems.  Direct  control  of 
individual  structures  to  understand  the  response  to  actuation 
and  its  underlying  physics  now  becomes  possible.  However, 
only  matching  of  the  length  scales  between  the  micro  transduc¬ 
ers  and  the  flow  features  will  not  be  enough  to  accomplish  the 
task  because  detection  and  control  logic  need  to  be  imple¬ 
mented.  Furthermore,  the  amount  of  data  associated  with  a  large 
number  of  transducers  is  tremendous.  The  bandwidth  needed  is 
currently  beyond  the  capability  of  the  existing  data  transmission 
instrumentation  to  handle  the  transmission  between  the  trans¬ 
ducer  assembly  and  a  computer.  A  more  stringent  requirement 
is  that  the  detection-decision-actuation  process  needs  to  be  ac¬ 
complished  in  the  life  time  of  the  wall  structures.  That  means 
real-time  operation  in  a  necessity.  In  other  words,  on-board 
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signal  processing  is  required.  A  nndcro-electro-mechanical-sys- 
tem  with  large  scale  integrated  microsensors,  microelectronics 
and  microactuators  offers  an  opportunity  to  solve  these  types 
of  engineering  problems  which  require  distributed  real-time 
control. 


The  Surface  Shear  Stress  Reduction  Experiment  The 
surface  shear  stress  reduction  experiment  was  carried  out  in  a 
two-dimensional  channel  flow  facility.  The  channel  has  a  cross 
section  of  2.54  cm  X  61  cm.  The  channel  is  488  cm  long. 
The  mean  velocity  can  be  varied  from  5  m/s  to  30  m/s.  The 
background  turbulence  level  is  about  0.1  percent 
The  detection  of  the  high  shear  stress  streaks  was  performed 
in  the  fully  developed  turbulent  portion  of  the  chaimel.  An 
understanding  of  the  interaction  between  the  actuator  and  the 
wall  structure  is  a  crucial  part  of  controlling  the  shear  stress. 
Since  the  integrated  sensing-decision-actuation  is  still  under 
development,  a  stationary  streamwise  vortex  embedded  in  the 
laminar  part  of  the  channel  flow  was  used  as  a  model  to  study 
the  actuator-vortex  interaction  (Tung  et  al.,  1995). 

Detection  of  High  Shear  Stress  Streaks.  A  surface  shear 
stress  imaging  chip  was  developed  for  mapping  the  instanta¬ 
neous  shear  stress  (Jiang  et  al.,  1996).  The  imaging  chip  con¬ 
tains  eighty-five  micro  sensors  (Fig.  23).  Each  of  the  three  long 
rows  contains  twenty-five  sensors  and  the  two  short  rows  have 
five  sensors.  By  placing  the  imaging  chip  in  the  turbulent  flow, 
the  voltage  outputs  from  twenty  five  sensors  in  the  same  row 
were  sampled  and  processed  against  the  calibration  curve  to 

•  convert  into  stress.  The  signals  are  displayed  as  2-D  shear  stress 
contours  in  Fig.  24.  The  time  axis  is  converted  to  a  pseudo 
streamwise  coordinate  by  multiplying  by  the  convection  speed. 
This  surface  stress  map  shows  the  long  streaky  pattern  of  the 
high  stress  region  (light  regions).  When  the  Reynolds  number 
is  reduced,  the  mean  width  of  the  streaks  increases  (Fig.  24(&)) . 
These  data  demonstrate  the  capability  of  distributed  sensors  in 
capturing  random  events. 

In  order  to  activate  the  actuator  only  in  the  high  stress  region, 
we  first  need  to  determine  the  transverse  dimension  of  the  high 
stress  region.  Then,  the  actuator  located  downstream  will  be 
moved.  A  single  sensor  which  is  much  smaller  than  the  size  of 
the  flow  feature  is  not  able  to  know  whether  itself  is  experienc¬ 
ing  high  stress  or  not.  A  detection  logic  circuit  (Fig.  25)  was 
design  for  this  purpose  (Gupta  et  al.,  1996).  Basically,  the 
signal  output  of  a  sensor  is  compared  with  its  neighboring  sen- 
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Fig.  24  Instantaneous  surface  shear  stress  measured  by  the  imaging 
chip,  (a)  Re  =  17,517;  (b)  Re  =  8,758.  Re  is  based  on  the  half  width  and 
centerline  velocity  of  the  a  turbulent  channel  flow. 


Rg.  25  Block  diagram  of  the  control  circuits 


Rg.  26  Contours  of  (a)  the  instantaneous  surface  shear  stress  and  (b) 
the  output  of  the  control  circuits 


sors  through  a  filter-threshold  combination.  Whenever  a  sudden 
change  is  found,  the  edge  of  the  high  shear  stress  region  is 
determined  (Fig.  26).  This  chip  can  provide  real-time  decisions 
and  also  delivers  current  to  energize  the  electro-magnetically 
driven  actuator. 

Interaction  between  Micromachined  Actuator  and 
Streamwise  Vortices.  A  crucial  part  of  shear  stress  control 
is  to  determine  the  efficient  control  scheme  which  needs  to  be 
developed  with  an  in-depth  understanding  of  the  interaction 
between  actuator  and  flow  structures.  As  a  first  step,  an  actuator* 
interacting  with  a  stationary  vortex  embedded  in  a  laminar  flow 
is  used  to  explore  this  phenomenon.  A  1.3  mm  thick  vortex 
generator  with  a  curved  surface  (Fig.  27)  is  placed  on  the 
surface  of  the  channel.  At  the  end  of  the  vortex  generator,  a 
longitudinal  vortex  pair  is  produced.  Between  the  two  vortices, 
high  speed  fluid  is  brought  toward  the  wall.  It  generates  a  high 
shear  stress  streak  (Fig.  28(a)).  The  micromachined  actuator 
is  a  silicon  nitride  flap  with  30  turns  of  copper  coil  and  matches 
the  size  of  the  longitudinal  vortex  pair.  A  permanent  magnet  is 
located  under  the  actuator.  When  an  a.c.  current  passes  through 
the  coils  at  a  constant  frequency,  the  flap  oscillates  at  that  fre¬ 
quency  (Miller  et  al.,  1996).  The  surface  shear  stress  is  mea¬ 
sured  and  phase  averaged  (Fig.  28(&)).  The  surface  shear  stress 
map  shows  an  appreciable  reduction.  We  integrate  the  shear 
stress  along  the  spanwise  direction  and  plot  the  integrated  stress 
as  a  function  of  the  phase  angle  of  the  actuator  motion  (Fig. 
29).  Throughout  the  whole  cycle,  the  surface  shear  stress  is 
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Rg.  28  Contours  of  ensemble-average  dU/dy.  (a)  Vortex  generator 
only;  (b)  Vortex  generator  and  an  oscillating  micro  actuator. 


lower  than  that  of  the  stationary  vortex  alone.  The  lowest  dura¬ 
tion  is  at  around  the  time  when  the  actuator  stays  at  the  upper¬ 
most  location.  At  20  Hz  oscillation,  the  stress  under  controlled 
conditions  can  be  lower  than  that  of  a  Blasius  velocity  profile. 
This  suggests  that  the  flap  type  actuator  can  effectively  transport 
the  low  speed  fluid  into  the  high  speed  stream.  At  frequencies 
higher  than  20  Hz,  the  efficiency  decreases.  However,  frequency 
should  not  to  be  the  proper  operating  parameter  to  characterize 
the  actuation.  We  believe  that  the  frequency  multiplied  by  the 
oscillation  amplitude  of  the  flap  is  a  more  appropriate  parameter 
because  it  is  related  to  the  perpendicular  speed  needed  to  trans- . 
port  the  high  speed  fluid  away  from  the  surface  in  the  high 
shear  stress  region.  More  experiments  are  underway  to  clarify 
the  actuator-vortex  interaction  phenomenon. 

Concluding  Remarks 

The  emerging  Micro-Electro-Mechanical-Systems  (MEMS) 
technology  deals  with  phenomena  in  a  very  distinct  length  scale, 
the  micron  size.  In  this  range,  the  surface  to  volume  ratio  is 
much  larger  than  that  in  the  macro  scale.  Hence,  the  surface 
effects,  e.g.  force  and  chemistry,  become  a  dominating  factor. 
On  the  other  hand,  MEMS  technology  provides  us  with  sensors 
and  actuators  in  the  micron  size.  These  micro  transducers  enable 
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Rg.  29  Variation  of  the  drag  coefficient  with  actuator  phase  angles  for 
different  actuator  oscillating  frequencies 


us  to  explore  and  control  the  natural  phenomena  in  a  length 
scale  regime  in  which  we  have  not  had  much  experience.  The 
integrated  micro-electro-mechanical  system  can  execute  in  real¬ 
time  a  sensing-decision-actuation  process  which  enables  us  to 
control  randomly  distributed  events  in  intricate  turbulence  con¬ 
trol  problems.  MEMS  is  a  field  which  is  rich  in  scientific  chal¬ 
lenges  as  well  as  an  enabling  technology  for  broad  engineering 
applications. 
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ABSTRACT 

A  technology,  Micro-Electro-Mechanical- 
Systems  (MEMS),  emerged  in  the  late  1980s  which 
enables  us  to  £ibricate  mechaiucal  parts  on  the  order  of 
microns.  Micromachining  technology  is  suitable  for 
developing  new  transducers  or  improving  existing 
transducer  designs.  Due  to  the  dranmtic  reduction  in 
size,  micro  transducers  can  ou^wrform  traditional  ones 
by  orders  of  magnitude.  Furthermore,  MEMS  is  a 
fhndamental  technology  which  has  the  potential  to 
influence  advancements  in  many  fields.  In  the 
automobile,  electronics,  bio-medical  and  television 
industries,  MEMS  products  have  alreacty  made 
appreciable  impacts.  In  this  paper,  the  applications  of 
MEMS  for  aeroctynamic  control  will  be  presented. 

INTRODUCTION 

A  new  technology  with  broad  applications 
may  sparic  a  revolutionary  change  in  industry.  Three 
det^es  ago,  microelectronics  technology  made  such 
an  advance:  the  conqruter  became  an  indispensable 
part  for  our  lives.  On  the  other  hand,  tiie  computer  is  a 
collection  solid  state  transistors  which  perform  first 
calculations  and  carry  out  preset  logic  decisions,  but  it 
does  not  interact  with  the  enviromnenL  For  sensing 
and  manipulating  the  enviromnent,  sensors  and 
actuators  are  needed,  most  of  which  are  mechanical, 
MEMS  technology  provides  sensors  and  actuators  that 
are  micron  sized.  Furthermore,  the  MEMS  fiibrication 
process  is  a  derivative  of  microelectronics  fiibrication 
technology.  It  is  possible,  though  not  necessarily 
straightforward,  to  integrate  micro-transducers  with 
the  microcomputer  such  that  an  integrated  tystem 
capable  of  sensing,  decision,  and  actuation  berames 
feasible.  The  potential  of  this  micro-electro- 
mechanical-system  is  immeasurable  and  could  parallel 
the  progress  of  microelectronics  technology. 
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Micromachining  is  a  batch  production  process 
which  utilizes  lithogit^hic  processes.  Three 
fabrication  techniques,  bulk  mioomachining,  surface 
micromachining,  and  the  LIGA  process,  are  commonly 
used  in  producing  mechanical  parts  on  the  micron 
scale.  A  brief  introduction  of  these  fiibrication  process 
can  be  found  in  a  review  paper A  micro  beam  of  6 
microns  in  width  and  ISO  microns  in  length  made  by 
surface  micromachining  ^  is.  shown  in  Fig.  1.  The 
beam  is  a  basic  configuration  and  is  used  in  various 
mechanical  designs.  When  surfiK:e  micromachining  is 
combined  with  bulk  micromachining,  we  can  make  the 
beam  structure  into  a  hot-wire  probe  for  tuibulmice 
research.  The  micro  hot-wire  probe  can  be  moved  in 
fiee  space  similar  to  the  conventional  one  and  has  a 
fiequency  response  teaching  1 ME^.  A  series  of  beams 
can  be  made  into  a  micto-conib  structure  ^  (Fig.  2) 
which  is  the  heart  of  the  micro  accelerom^r  used  in 
airbag  sensor  produced  Ity  Analog  Devices  (Fig.  3). 
This  microsensor  has  rqrlaoed  the  traditional 
automobile  collision  sensor  in  all  American  made  cars 
since  I99S. 


Fig.  1.  A  micro  beam  (Tai  and  Muller  1988). 
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Fig.  2.  Comb-drive  resonator  (Tang,  Nguj'cn,  and 
Howe  1990). 


Fig.  3.  Air  bag  sensor  (Analog  Devices). 

The  lithographic  process  can  be  used  to  make 
a  large  number  of  transducers  on  a  single  wafer.  This 
capability  enables  us  to  use  distributed  transducers  to 
detect  phyrical  phenomena.  Texas  Instruments 
develop^  an  array  of  micro  minors  for  image 
projectiotL  In  a  2.3  cm^  area,  about  400  Aousand 
individually  addressable  mirrors  are  used  to  direct  light 
in  a  maimer  so  that  high  intensity  and  high  contrast 
images  can  be  produced  for  slide  or  tdeviaon 
projections  (Fig.  4). 


Fig.  4.  Micro  optic  mirror  (Texas  Instrument). 


Distributed  control  has  many  triplications  in 
aenxtynamics  We  will  discuss  two  case  studies:  one 
is  sur£tee  shear  stress  reduction  and  the  other  is  the 

maneuvering  of  aircraft  by  micro  actuators 
Reducing  smfece  shear  stress  in  a  turbulent  boundary 
layer  has  remained  one  of  the  most  challenging  tasks 
in  fluid  mechanics  research.  The  roots  of  the  problems 
are: 

1.  the  transverse  length  scale  of  the  near  wall 
structures  is  on  the  order  of  hundreds  of  microns 

2.  the  life  time  of  the  structure  is  about  one  milli¬ 
second  or  less 

3.  the  near  wall  structures  are  i^iead  over  the 
surface  area  covered  by  the  boundary  layer. 

Traditional  transducers  can  not  satisfy  the 
requirements  for  detecting  and  interactrvefy  controlling 
the  individual  near-wall  flow  structures.  MEN^  based 
transducers  are  small  enou^  to  sense  the  surfece  high- 
shear  region  and  can  be  used  as  the  basis  of  the 
decision  maHft  by  the  onboard  circuit  to  manipulate 
flow  structures.  &  is  possible  to  buUd  such  a 
Microsensors-Microelectronics-Microactuators  tystem 
(  a  tystem)  on  a  wafer  for  distributed  real-time 
control.  This  tqtproadi  embles  ns  to  obtain  basic 
understanding  of  how  a  ttear-wall  structure  responds  to 
artificial  pertutbarions  as  well  as  to  develt^  die  process 
for  fobricating  a  system.  The  active  control  of 
near-wall  structures  may  lead  to  die  reduction  of 
viscous  shear  stress.  Whether  the  mioomachined 
transducers  mil  be  used  on  aircraft  for  turbulent  shear 
stress  reduction  remains  to  be  seen.  The  fundamental 
knowledge  obtained  fiom  building  a  M^  system  for 
controlling  an  intricate  natural  phenomenon  has  broad 
applications  for  numerous  engineering  and  scientific 
di^enges. 

As  a  matm' of  fiict,  the  concept  of  using  micro 
transducers  for  fluidic  control  has  led  us  to  discover  a 
w^  of  controlling  a  macro  aerodynamic  device,  an 
airmaft,  Ity  using  micro  actuators.  On  a  delta  wing  a 
large  portion  of  the  lift  is  contributed  Ity  the  leading- 
edge  sqiaration  vortices  wUdi  originate  fiom  a  thin 
boundary  layer.  The  thin  boundary  layer  at  the  leading 
edge  piovidiK  a  region  for  coiqili^  between  the  micro 
actuator  and  the  ^obal  aerocfynamic  flow  field.  By 
using  the  mioo  actuators  to  alter  the  sqiaration  region, 
the  symmetry  of  the  vortex  pair  is  destroyed  and  a 
rolling  torque  is  produced.  In  wind  turmel  tests  of  a 
delta  wing  model,  the  measured  torque  is  appreciable. 
For  this  case,  we  performed  a  flight  test  of  a  1/7  model 
of  a  Mirage  in  filter.  The  effect  is  phenomenal. 
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In  this  paper,  we  will  first  discuss  the 
turbulent  surfitce  shear  stress  reduction  e}q>eriment  and 
then  the  delta  wing  control  results  will  be  presented. 

TURBULENT  SHEAR-STRESS  CONTROL 

The  presence  of  near-wall  longitudinal  streaks 
in  turbulent  boundary  layers  contributes  to  a  significant 
portion  of  the  total  skin  friction.  The  streaks  are  on  the 
scale  of  several  hundred  microns  at  the  Reynolds 
number  range  tested  in  the  wind  turmel.  The 
experiment  is  carried  out  in  a  two-dimensional 
chaimel-flow  wind  turmel.  The  channel,  constructed  of 
Plexiglas  plates,  is  24'*xr  in  cross-section  and  192” 
long.  An  axial  blower  supplies  the  air  flow  in  the 
charmel. 

Single  Point  Surface  Shear  Stress  Measurement 

A  micro  shear-stress  sensor  has  been 

specifically  developed  to  measure  the  local  sur&ce 
shear  stress  in  a  turbulent  channel-flow  wind  turmel. 
The  micro  sensor  measures  flie  shear-stress  based  on 
the  heat  transfer  from  a  micro  heating  element  A  1 
micron  thick  vacuum  chamber  beneath  a  silicon  nitride 
di^hragm  siqrporting  the  heating  element 
dgnififantly  reduces  heat  loss  to  the  substrate.  The 
sensor  is  flush-mounted  on  the  wall  at  168”  fix>m  the 
irdet  of  the  wind  turmel  where  hot-wire  and  static 
pressure  measurements  have  confirmed  the  presence  of 
a  fully-developed  turbulent  charmel  flow.  The  DC 
ou^t  of  the  micro  sensor  is  calibrated  against  time- 
average  surfece  shear  stress  calculated  fiom  the 
streamwise  pressure  gradient  Time  traces  of  the  micro 
sensor  ou4>ot  are  recorded  for  five  different  Remolds 
numbers  ranging  fiem  7,000  to  20,000.  The  Ri^olds 
nutnber  is  calculated  based  on  the  centerline  velocity 
and  the  half  width  (1.27  cm)  of  the  charmel.  The  time 
traces  are  converted  into  siufiice  shear  stress  based  on 
the  calibratiotL  The  statistics  of  the  fluctuating 
turbulence  surfiice  shear  stress  are  computed. 

Figure  5  shows  the  comparison  in  the  RMS, 
the  skewness  fiictor,  and  the  flatness  fiictor  of  the 
surface  shear  stress  as  measured  by  the  micro  shear- 
stress  sensor  and  in  other  smdies. 


Fig.  5.  Turbulence  statistics  measured  by  a  single 
micro  shear-stress  sensor. 

As  can  be  seen,  the  turbulence  statistics 
measured  by  flie  micro  sensor  compare  very  well  with 
both  previous  e3q)etimental  and  numerical  results. 

Distributed  Measurement 

After  the  performance  of  a  single  micro  shear- 
stress  sensor  was  demonstrated,  a  surfitce  shear-stress 
imaging  chip  (Fig.  6)  was  developed  to  measure  the 
instantaneous  turbulent  shear-stress  distribution.  Each 
imaging  chip  consists  of  85  micro  sensors.  On  the 
chip,  the  sensors  are  arranged  into  multiple  arrays,  and 
the  gracing  between  consecutive  sensors  is  designed  to 
provide  adequate  spatial  resolution  in  measuring  the 
near-wall  streamwise  streaks.  The  imaging  chip  is 
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packaged  into  a  specially  designed  PC  board  which 
allows  the  micro  sensors  to  be  flush  mounted  to  the 
wall  of  the  turbulent  chaimel-flow  wind  tuimel  ^ere 
the  experiment  is  carried  out  One  array  of  25  micro 
sensors  covering  a  spanwise  distance  of  7.5  mm  is  used 
in  the  measurement  Each  sensor  in  the  array  is  driven 
by  an  external  constant  temperature  circuit,  and  the 
ou^ut  is  similarly  calibrated  as  in  the  single  sensor 
test  Figure  3  shows  the  instantaneous  sur&ce  shear* 
stress  contours  measured  by  the  micro  sensor  array. 
The  horizontal  axis  of  the  contour  plots  is  the  spanwise 
distance  covered  by  the  sensor  array  and  the  vertical 
axis  is  a  pseudo-streamwise  distance  computed  fiom 
the  sampling  time  of  the  sensor  signals  and  the 
convective  velocity  of  the  flow.  The  shear-stress 
contours  show  randomly-distributed  high  shear-stress 
streaks.  An  estimate  of  the  length,  the  width,  and  the 
spacing  of  the  streaks  indicate  that  the  scales  of  the 
streaks  measured  by  the  micro  sensors  agree  very  well 
with  previous  flow  visualization  observations. 


Fig.  6.  Instantaneous  sur&ce  shear  stress  measured  by 
the  imaging  chip.  He  =  17,517. 


Edge  Detector  for  Detecting  High  Shear-Stress  Redons 

The  hi^-shear  stress  streaks  are  randomly 
distributed.  A  real-time  detection  scheme  is  needed  to 
properly  identify  the  streaks  among  the  background 
fluctuations  so  that  the  downstream  actuator  can  be 
activated.  A  neural  netn'ork  based  detection  circuit  has 
been  developed  for  this  purpose '®.  In  this  circuit,  the 
signal  output  of  a  sensor  is  compared  with  its 
neighboring  sensors  through  a  filter-threshold 
combination.  Whenever  a  sudden  change  is  detected, 
the  boundary  of  the  high  shear  stress  region  is  marked. 
This  circuit  has  been  used  in  conjunction  with  the 
imaging  chip  and  a  fypical  result  is  shown  in  Fig.  7. 
The  light  region  in  the  control  ouqrut  ^ig.  7-b) 
rqrresents  ‘positive’  identification  of  high  shear-stress 
streaks.  The  good  matching  between  the  light-color 
region  and  the  high  shear-stress  area  (light  gray 
region)  measured  by  the  imaging  chip  O^ig.  7-a) 
indicates  the  effectiveness  of  the  detection  circuit 


(a)  (b) 

Fig.  7.  Grntours  of  (a)  die  instantaneous  sutfiice  shear 
stress  and  (b)  the  ouQint  of  the  control  circuits. 

Interaction  between  Nficromachined  Actuator  and 
Streamwise  Vortices 

To  property  control  the  high  shear-stress 
streaks  with  micro  actuators,  an  in-dq>th 
understanding  of  the  interaction  between  the  actuators 
and  the  flow  structures  is  required.  Experiments  have 
been  carried  out  to  investigate  the  interaction  between 
a  single  high  shear-stress  streak  and  a  micromachined 
actuator  ”•'*•>9.2021  jjj  2  j  3  mm 

vortex  generator  is  used  to  generate  a  lon^tudinal 
vortex  pair  in  a  laminar  boundary  layer.  The 
micromachined  actuator  used  is  a  silicon  flap  with  30 
turns  of  copper  coil  and  matches  the  pl^rsiod  dze  of 
the  longitudinal  vortex  pair.  Oscillation  of  the  Gap  is 
achieved  by  the  combination  of  an  external  permanent 
magnet  and  an  AC  coil  current  The  maximum 
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deflection  of  the  flap  is  about  30  degrees,  which 
corresponds  to  height  of  2  mm  from  the  tip  of  the 
actuator  flap  to  the  wall.  The  maximum  oscillation 
frequency  is  about  100  Hz.  The  actuator  is  placed 
downstream  ftom  the  vortex  generator  where  the 
longitudinal  vortex  pair  is  generated.  Experiments  are 
carried  out  for  different  combinations  of  actuator 
fiequency  (ca)  and  maximum  tip  height  (d).  For  each 
combination,  a  two-component  (stieamwise  and 
spanwise)  hot-wire  anemometer  is  used  to  measure  the 
velocity  distributions  downstream  from  the  actuator. 
From  the  velocity  distributions,  the  vorticity  and 
suifrice  shear  stress  distributions  are  calculate  and 
phase  averaged. 

The  longitudinal  vortices  generated  by  the 
vortex  generator  convect  high-speed  fluid  to  the  wall, 
inducing  a  local  high  shear-stress  streak  (Fig.  8)  on  the 
sur&ce.  As  the  actuator  oscillates,  it  generates 
perturbations  in  the  flow  which  change  the  shear  stress 
intensity  of  the  streak.  As  indicated  tty  the  ensemble- 
average  result  shown  in  Fig.  8,  the  intensity  of  the 
streak  decreases  as  the  flap  actuator  deflects  awtty  from 
the  surfrice,  reaching  a  minimum  at  the  maximum 
actuator  tip  height  (phase  =ii).  As  the  actuator  moves 
back  to  the  surface,  the  streak  intensity  reverses  to  its 
original  level.  To  evaluate  the  net  effect  of  the  actuator 
oscillation  on  the  shear  stress  distribution,  the  net 
shear  stress  coefScient,  Qm,  defined  as 
^  1  r+z 

~  (^)  ~  > 

for  different  o-d  combinations  are  evaluated  and 
shown  in  Fig.  9.  The  Cb.vo  in  the  equation  is  the  time- 
invariant  shear  stress  coefficient  associated  with  the 
stationary  high  shear-stress  streak  induced  by  the 
vortex  generator.  Defined  this  way,  CW  indicates  a 
shear  stress  increase  if  positive  and  a  drag  decrease  if 
negative.  As  shown  in  Fig.  9,  higher  drag  reduction  is 
achieved  at  higher  co  and  higher  d.  In  addition,  similar 
shear  stress  reduction  is  the  result  if  the  product  of  (d 
and  d  is  the  same.  Since  the  product  of  co  and  d  is  a 
measurement  of  the  transverse  velocity  of  the  actuator 
fhq),  this  result  indicates  that  the  amount  of  shear 
stress  reduction  is  directly  related  to  the  transport  of 
high-speed  fluid  away  from  the  surface  by  the  vertical 
motion  actuator. 


Surface  Shear  Stress  Map 


Vortex  Vortex  +  Actuotioii 

(a)  (b) 

Fig.  8.  Contours  of  ensemble-average  dU/dy.  (a) 
Vortex  generator  ordy,  (b)  Vortex  generator  and  an 
oscillating  micro  actuator. 


Fig.  9.  Variation  of  the  shear  stress  coefficient,  Q»i, 
with  actuator  fiequency,  &,  and  maximum  actuator  tip 
height,  d.  Each  line  is  the  reailt  of  one  d,  as  indicate 
at  the  end  of  the  line.  The  solid  O  markers  correspond 
to  an  (od  of  80  and  the  solid  □  markers  correspond  to 
an  cad  of  100. 

A  System  far  Shear  Stress  Reduction 

To  integrate  microsensors,  microelectronics 
and  microactuators  on  a  chip  is  possible,  yet  is  not  a 
trivial  task.  The  transducers  may  need  to  be  febricated 
at  a  much  higher  temperature  than  that  <ff  the  circuits 
which  means  the  diffiision  of  the  doping  material  in 
the  circuits  may  affect  its  performance.  The  process 
flow  needs  to  be  carefully  plarmed  to  eliminate  cross¬ 
contamination  possibilities.  We  have  fribricated  a 
system  for  shear  stress  reduction  (Fig.  10).  On  this  1 
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cm  1  cm  die,  18  micro  shear  stress  sensors,  3  micro 
flap  actuators  as  well  as  circuits  for  logic,  sensor  driver 
and  actuator  drivers  are  monolithically  integrated.  It 
is  not  fully  functional  yet.  Experience  gained  through 
this  exercise  are  extremely  valuable  for  us  to  proceed 
toward  a  working  system. 


Fig.  10.  A  system  for  shear  stress  reduction 

MACRO  AERODYNAMIC  DEVICES 
CONTROLLED  BY  MICRO  ACUTATORS 


For  controlling  the  motion  of  an  aircraft  with 
a  delta  wing,  we  need  to  manipulate  the  vortices  which 
have  a  length  scale  of  the  wing.  This  is  very  different 
from  controlling  the  small  near  wall  structures  in  the 
sur&ce  shear  stress  control  erqreriment  The  problem 
is  to  find  a  coupling  between  the  micro  actuators  and 
the  wing-size  flow  structures.  It  is  known  that  the 
evolution  of  the  vortex  is  sensitive  to  perturbations  at 
its  origin  The  leading  edge  vortices  devdop  fiom 
a  thin  boundary  layer  sqrarated  fiom  the  surftice.  We 
can  enqrloy  micro  actuators  with  a  di^lacement 
oortqKirable  to  the  boundary  h^er  fliidcness  to 
manipulate  the  sqMuation  process,  and  therefore 
achieve  control  of  the  global  motion  by  minute 
perturbations. 

Experimental  Setup 

An  open-type  wind  turmel  was  used  for 
aerodynamic  tests.  The  test-section  is  3  ft  x  3  ft.  and 
22  ft.  long.  The  maximum  operating  speed  of  this 
wind  turmel  is  45  m/s.  The  turbulence  level  of  the 
wind  tunnel  is  about  0.6  %. 

A  delta  wing  model  with  a  swept  angle  of 
56.5°  was  used  for  this  study.  The  delta  wing  model 
has  rounded  leading  edges.  Micromachined  actuators 
and  micro  shear  stress  sensors  were  placed  on  a 
cylindrical  rod  located  at  the  leading  edges  of  a  delta 


wing.  The  rod  can  be  rotated  such  that  the  locations  of 
these  transducers  can  be  altered  to  cover  the  whole 
leading  edge  area.  Using  micro  shear  stress  sensors, 
we  can  nuyr  the  shear  stress  distribution  on  the  leading 
edge,  and  consequently  determine  the  separation  line. 
We  also  can  find  the  optimum  actuator  location  for 
micro  actuators  which  use  this  type  of  setup 

Control  of  Rolling.  Pitching,  and  Yawing  Moments 

A  delta  wing  with  a  swept  angle  of  56.5“  is 
moimted  on  a  six-component  force/moment  transducer 
(AMTI,  INC.)  which  records  forces  and  moments  in  all 
three  axes.  First,  we  used  actuators  which  extend  fiom 
the  apex  to  the  trailing  edge,  the  A-T  actuator.  The 
location  of  the  actuator  is  indicated  by  the  angle,  6, 
which  is  measured  fiom  the  lower  surf^  of  the  wing 
(Fig.  11).  At  different  angles  of  attack  (  a  )  the 
incremental  torque  generated  by  the  miniature 
actuator,  Mr,  was  normalized  by  the  torque  produced 
by  the  vortex  lift.  My ,  which  is  the  product  of  the  lift 
generated  by  a  single  leading  edge  vortex  and  the 
distance  between  the  centroid  of  the  half  wing  and  the 
center  line*.  In  this  psper,  M,  at ,  a  =  30°  is  used  for 
all  the  normalizatioiL 


a:  angle  of  attack 

6:  aog^e  measured  fiom  the  bottom  sorfKC 

Fig.  11.  Schematic  representation  of  delta  wing 
system 


The  data  for  a  =  25°  arc  presented  in  Figs.  12 
-  14.  In  general,  the  curves  of  all  three  moments 
collapse  when  thQ^  are  normalized  to  My.  Here,  My  is 
proportional  to  the  velocity  to  the  second  power. 
Inertia  of  the  wing  is  a  function  of  weight  distrflnrtion 
and  geometry,  but  is  independent  of  velocity.  Using 
miniature  actuators  the  torque  can  be  achieved  with 
high  angular  acceleration  and  an  increase  in  velocity. 

Figure  12  shows  that  rolling  moment  has  a 
positive  peak  of  40%  at  around  0  =  40°-50“  which  has 
been  identified  as  the  separation  region  in  a  previous 
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paper'®.  Around  0  =  80°~90‘’,  a  negative  peak  of  40  % 
was  detected.  Using  this  feature,  we  can  obtain  an 
additional  rolling  moment  by  using  two  A-T  actuators 
placed  on  two  leading  edges,  one  at  the  positive  peak 
and  the  other  one  at  the  negative  peak.  The  test  re^ts 
show  that  resultant  rolling  torque  is  equal  to  the  sum  of 
the  two  effects.  At  AOA=  25°,  the  normalized  rolling 
moment  can  reach  70%  using  this  combination. 

The  pattern  of  the  pitching  moment  is 
different  from  that  of  rolling  moment  (Fig.  13).  For  0 
<  30°,  the  pitching  moment  is  close  to  zero  and  has  its 
peak  of  15%  at  0  =  50°~60°.  If  we  use  two-sided 
actuation,  about  30%  of  the  pitching  moment  can  be 
achieved. 

The  yawing  moment  (Fig.  14)  also  has  a 
different  profile  fix>m  that  of  the  rolling  and  pitching 
moment.  When  the  actuator  is  placed  aroimd  the  lower 
surfiice,  i.e.  0  <  40°.  Only  -4  %  change  of  yawing 
moment  is  found  at  0  =  40°  and  8%  at  0  =  90°.  About 
12%  of  yawing  moment  can  be  obtained  if  two-sided 
actuation  is  employed. 


Fig.  13.  Normalized  pitching  moment  vs.  actuator 
location  at  AOA=  25°,  A-T  actuation 


angle  measured  from  bottom  surfoce  (deg) 


Fig.  12.  Normalized  rolling  moment  vs.  actuator  Normalized  yawing  moment  vs.  actuator 

location  at  AOA=  25°,  A-T  actuation  ^<^0“  25°,  A-T  actuation 


The  actuator  design  which  extends  half-way 
between  the  apex  and  trailing  edge,  the  H-A-T 
actuator,  has  been  tested.  Two  H-A-T  actuators  were 
placed  on  both  leading  edges.  Data  shows  that  about 
30%  of  the  normalized  pitching  moment  and  8  %  of 
the  normalized  yawing  moment  can  be  generated  by 
using  this  kind  of  actuation  (Figs.  15  -  16).  Moreover, 
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if  two  H-A'T  actuators  are  positioned  at  the 
corresponding  location  of  each  leading  edge,  the 
actuation  provides  zero  rolling  moment  and  a  finite 
pitching  moment  These  tests  suggest  that  extensive 
possibilities  of  controlling  aero(fynamic  moments  exist. 
In  other  words,  a  system  of  distributed  actuators  can  be 


Re3molds  number 

Figure  IS.  Normalized  pitching  moment  vs.  Rqnolds 
number  at  AOA=2S‘* ,  two^ided  H-A-T  actuators 
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Figure  16.  Normalized  yawing  moment  vs.  Reynolds 
number  at  AO A=25‘‘ ,  two-sided  H-A-T  actuators 


Flight  Test  of  a  Scaled  Model  Airplane 

An  one-seventh  scale  Mirage  m  airplane  with 
a  rounded  leading  edge  was  built  for  a  pioof-of-concept 
test.  Miniature  mechanical  actuators  (3mm  and  6mm) 
were  mounted  at  different  locations  on  the  leading  edge 
to  investigate  the  optimum  location  for  torque  control. 
Five  different  locations,  0“,  40“,  60°,  90°,  and  120°, 
have  been  tested  and  video  footage  shows  that  60°  is 
the  most  effective  location.  Video  footage  also  shows 
that  several  maneuver  modes  can  be  achieved  by  using 
miniature  actuators  including  a  180°  roll  in  0.8  second 
as  well  as  swift  turning  and  looping  of  the  airplane. 
No  appreciable  difference  can  be  observed  for  3  mm 
and  6  mm  actuators  which  suggests  that  the  boundary 
layer  at  the  leading  edge  should  be  3  mm  or  less  for 
this  scale.  Further  reduction  in  the  size  of  the  actuators 
will  be  tested.  In  the  future,  extensive  e?q>eriments  on 
scaled  aircraft  will  be  carried  out  for  investigating 
moment  control  by  micro  actuators. 

CONCLUDING  REMARKS 

MEMS  technology  can  be  used  to  fiibricate 
sensors  and  actuators  in  sizes  which  are  one  or  more 
orders  of  magnitude  smaller  than  the  ones  made  by 
traditional  techniques  so  that  the  temporal  and  spatial 
resolutions  can  be  greatly  improved.  Another  property 
of  the  MEMS  sensor  is  that  it  can  be  monolithically 
integrated  wifii  microelectronics.  The  MEMS  airbag 
sensor  uses  this  advantage  to  increase  its  fimctionality 
and  lower  the  unit  costs  such  that  it  occupies  a  major 
share  of  the  market  in  a  short  time.  Using  a  large 
number  of  actuators  to  control  the  lig^t  beam  in  a 
distributed  maimer,  the  Texas  Instruments  DLP  mirror 
array  introduces  a  completely  new  aqiproach  to  image 
projection. 

These  three  features  of  MEMS  transducers, 

1.  small  physical  size 

2.  ability  to  be  integrated  with  microelectronics 

3.  distrfiiuted  control  by  utilizing  arrays  of 

transducers, 

can  make  mar^  difficult  flow  control  tasks  possible. 
The  M^  system  reported  in  this  paper  enables  us  to 
directly  perturb  individual  near-wall  flow  structures  in 
the  turbulent  boundary  layer  for  better  understanding 
of  the  flow  physics  and  possible  reduction  of  the 
surface  shear  stress.  The  delta  wing  experiment  has 
demonstrated  that  micro  actuators  are  possible  to 
influence  the  global  aerodynamics. 
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MEMS  is  a  revolutionary  technolog>’  which 
has  and  will  continue  to  facilitate  basic  research  and 
make  impacts  to  numerous  industries.  The  experience 
of  the  last  few  years  in  the  flow  control  area  shows 
that: 

1.  the  small  physical  size  is  idea  for  sensor 
applications.  We  have  demonstrated  the  air¬ 
worthiness  of  the  micro  sensors  by  placing 
them  around  the  leading  edge  of  a  wing  of  an 
F-15  which  flew  at  M=1  speed.  The  sensors 
are  robust  enough  without  suffering  structure 
damage. 

2.  when  microelectronics  is  used  for  real-time 
control,  a  simple  circuit  is  necessary  to  save 
the  processing  time  and  real  estate  on  the 
chip.  A  neural  net\\'oik  can  be  a  useful  choice 
for  satisfying  these  requirements,  and  its  use 
has  been  illustrated  in  a  drag  reduction 
numerical  experiment 

3.  the  force  and  displacement  outputs  of 
actuators  used  in  flow  control  tasks  are  usually 
much  higher  than  typical  MEMS  actuators 
can  provide.  Special  efforts  of  developing 
actuators  for  satisfying  these  requirements  are 
in  need. 
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ABSTRACT 

The  micromachining  technology  that  emerged  in  the  late  1980s  is  able  to  provide  micron-sized 
sensors  and  actuators.  These  micro  transducers  can  be  integrated  with  signal  conditioning  and 
processing  circuitry  to  form  micro-electro-mechanical  systems  (MEMS)  that  can  perform  real¬ 
time  distributed  control.  This  capability  opens  up  new  territory  for  flow  control  research. 
However,  because  of  the  large  surface-to-volume  ratio  in  micron-scale  configurations,  surface 
effects  dominate  the  fluid  flowing  through  these  miniature  mechanical  devices.  For  this  reason, 
we  need  to  re-examine  the  surface  forces  in  the  momentum  equation.  Because  of  the  small  size, 
gas  flows  experience  large  Knudsen  numbers  and,  therefore,  boundary  conditions  need  to  be 
modified.  In  addition  to  being  an  enabling  technology,  MEMS  also  provides  many  challenges  for 
fundamental  flow-science  research. 


1.  INTRODUCTION 

During  the  past  decade,  micromachining  technology  has  become  available  to  fabricate  micron¬ 
sized  mechanical  parts.  Micromachines  have  made  major  impacts  on  many  disciplines  (e.g., 
biology,  medicine,  optics,  and  aerospace,  mechanical  and  electrical  engineering).  In  this  article, 
we  will  limit  our  discussion  to  transport  phenomena,  specifically  emphasizing  the  fluid  d^aimcs 
issues.  This  emer^ng  field  not  only  provides  miniature  transducers  for  sensing  and  actuation  in  a 
domain  that  we  could  not  examine  in  the  past,  but  also  allows  us  to  venture  into  a  research  area  in 
which  the  surface  effects  dominate  most  of  the  phenomena. 

Figure  1  shows  a  scanning-electronic-microscope  (SEM)  picture  of  an  electrostatically  driven 
motor  (Fan,  Tai  &  Muller  1988a).  This  device  signifies  the  beginning  of  the  micromachine  field. 
A  comb  structure  (Tang  et  al  1989)  derived  fi'om  the  micro  motor  concept  eventually  evolved  into 
the  airbag  sensor,  which  reduces  the  damage  caused  by  automobile  collisions  and  is  used  now  on 
almost  all  American-made  cars.  During  the  development  of  the  micro  motor,  it  was  found  that 
the  fictional  force  between  the  rotor  and  the  substrate  is  a  fimction  of  the  contact  area.  This 
result  departs  fi’om  the  traditional  fiictional  law  (i.e.,  f  =  |tN),  which  says  that  the  fiictional  force 
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is  linearly  proportional  to  the  normal  force,  N,  only.  In  the  micro  motor  case,  the  surface  forces 
between  the  rotor  and  the  substrate  contribute  to  most  of  the  frictional  force.  However,  the 
traditional  frictional  law  describes  situations  with  a  dominating  body  force  that  do  not  depend  on 
the  contact  area.  Deviations  from  the  conventional  wisdom  are  commonly  found  in  the  micro 
world.  This  makes  the  micromachine  field  a  new  technology  as  well  as  a  new  scientific  frontier. 


Figure  1  A  micro  motor  (from  Fan,  Tai&  Muller  1988a).  A  piece  of 
human  hair  is  shown  in  front  of  the  motor  to  illustrate  the  minute  size. 

The  micromachining  process  uses  lithography  to  expose  the  designed  photo-resist  patterns;  the 
unwanted  portion  is  then  selectively  etched  away.  These  procedures  are  similar  to  those  used  in 
integrated  circuit  (IC)  fabrication  but  with  a  difference:  3-D  and  freestanding  structures  are 
common  features,  because  of  the  nature  of  mechanical  parts.  Several  manufacturing  technologies, 
such  as  bulk  micromachining,  surface  micromachining,  and  LIGA  (a  German  acronym  for 
lithographe,  galvanoformung,  and  abformung),  have  been  developed  to  make  various 
micromachines.  A  brief  introduction  of  these  technologies  can  be  found  in  a  paper  by  Ho  and  Tai 
(1996).  For  detailed  information,  readers  are  referred  to  several  extensive  articles  (Peterson 
1982,  Seidel  1987,  Ristic  1994). 

Micromachines  have  several  unique  features.  First,  typical  micromachined  transducer  sizes  are  on 
the  order  of  100  microns,  which  can  be  one  or  more  orders  of  magnitude  smaller  than  traditional 
sensors  and  actuators.  The  drastic  reduction  in  inertia  due  to  these  smaller  sizes  means  a 
substantial  increase  in  the  frequency  response.  Second,  the  batch  processing,  which  is 
characteristic  of  IC  fabrication,  can  be  used  to  make  a  large  number  of  transducers  for  distributed 
sensing  and  actuation  over  a  wide  area.  This  capability  enables  us  to  sense  certain  flow 
characteristics  in  a  2-D  domain  and  to  perform  control  at  the  proper  locations.  Potential 
application  areas  include  the  manipulation  of  separation  over  a  smooth  contour  or  the  reduction 
of  surface  shear  stress  in  a  turbulent  boundary  layer.  Third,  micromachine  manufacturing 
technology  is  derived  from  (though  not  completely  compatible  with)  IC  fabrication,  so  it  is 
possible  to  integrate  the  IC  with  micro  transducers  to  provide  logic  capability.  The  integrated 
microelectronics  and  micromachines  constitute  the  micro-electro-mechanical  system,  which  is 
capable  of  executing  sense-decision-actuation  on  a  monolithic  level. 
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In  biomedical  applications,  fluid  transport  is  commonly  required  in  drug  delivery  and  in  chemical 
and  DNA  analysis.  When  dealing  with  flow  in  configurations  of  microns  or  less,  we  have 
observed  many  unexpected  phenomena  that  are  similar  to  the  aforementioned  experience  of 
frictional  force  between  solid  surfaces.  Sir  Eddington  (1928)  once  said,  “We  used  to  think  that  if 
we  know  one,  we  know  two,  because  one  and  one  are  two.  We  are  finding  that  we  must  learn  a 
great  deal  more  about  ‘^and’”  Indeed,  the  flows  in  macro  and  micro  configurations  are  not  quite 
the  same.  The  unique  features  in  micromechanics  are  perhaps  the  most  intriguing  ones  for 
researchers  in  basic  fluid  mechanics.  We  still  have  a  great  deal  of  difficulty  in  understanding  these 
features,  because  not  much  is  known  about  the  complex  surface  effects  that  play  major  roles  in 
these  events.  The  search  for  answers  will  excite  researchers  for  years  to  come.  In  this  paper,  we 
will  first  report  and  discuss  the  fundamental  micro  fluid-mechanics  issues  and  then  proceed  to 
review  flow  sensing  and  control  using  MEMS. 

2.  SIZE  EFFECTS 

2.1  Ratio  between  Surface  Force  and  Body  Force 

The  length  scale  is  a  fundamental  quantity  that  dictates  the  type  of  forces  governing  the  physical 
phenomena.  Body  forces  are  scaled  to  the  third  power  of  the  length  scale.  Surface  forces  depend 
on  the  first  or  second  power  of  the  characteristic  length.  Because  of  the  difference  in  slopes,  the 
body  force  must  intersect  with  the  surface  force.  In  biological  studies  (Went  1968),  empirical 
observations  indicated  that  a  millimeter  is  the  approximate  order  of  the  demarcation  length.  The 
experiences  gathered  in  MEMS  also  show  that  surface  forces  dominate  in  sizes  smaller  than  a 
millimeter.  For  example,  the  friction  experienced  by  the  100-micron-diameter  micro  motor  (Fan, 
Tm  &  Muller  1988a  &  b)  must  be  caused  mainly  by  the  surface  force,  because  the  rotor  started  to 
move  when  the  contact  area  between  the  rotor  and  the  substrate  was  reduced  by  placing  dimples 
on  the  lower  surface  of  the  rotor. 

2.2  Ratio  between  Device  and  Intrinsic  Length  Scales 

In  addition  to  the  large  surface  force,  the  large  surface-to-volume  ratio  is  another  characteristic 
inherent  in  small  devices.  This  ratio  is  typically  inversely  proportional  to  the  smaller  length  scale 
of  the  cross  section  of  the  device  and  is  about  one  micron  in  surface  micromachined  devices. 
Therefore,  the  surface-to-volume  ratio  is  much  larger  in  a  micro  device  than  in  a  macro  de\dce, 
wWch  accentuates  the  role  of  surface  force  as  well  as  other  surfece  effects  in  general. 

In  micro  flows,  the  Reynolds  number  is  typically  very  small  and  shows  the  ratio  be^een  the 
viscous  and  inertial  forces.  However,  in  the  case  when  gas  is  the  working  fluid,  the  size  can  be 
small  enough  to  fiirther  modify  the  viscous  effect  when  the  device  length  scale  is  on  the  order  of 
the  mean  free  path.  For  large  Knudsen-number  flows,  the  flow  velocity  at  the  surface  starts  to 
slip  (Knudsen  1909,  Kennard  1938);  therefore,  the  \nscous  shear  stress  is  much  reduced.  For 
liquid  flows,  the  distance  between  molecules  is  on  the  order  of  angstroms.  The  non-slip 
condition  ba«8  always  been  used  as  an  empirical  result.  By  using  a  molecular  dynamics  approach 
(Koplik  et  al  1989,  Koplik  &  Banavar  1995),  the  non-slip  condition  at  the  solid  surfece  is 
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established  in  Couette  and  Poiseuille  liquid  flows.  On  the  other  hand,  molecular  ordering  has 
been  observed  and  results  in  oscillatory  density  profiles  in  the  vicinity  of  the  wall,  which  has  a 
thickness  of  a  few  molecular  spacings.  In  the  case  of  a  moving  contact  line  at  the  fluid  fluid  solid 
interface,  the  non-slip  condition  needs  to  be  relaxed  (Dussan  &  Davis  1974).  Typical 
micromachined  devices  have  a  length  scale  much  larger  than  the  molecular  spacing  of  simple 
liquids.  Hence,  the  non-slip  boundary  condition  should  hold  in  the  absence  of  a  moving  contact 
line. 

In  other  situations,  the  bulk  flow  is  modified  instead  of  the  boundary  condition.  For  example, 
most  solid  surfaces  have  electrostatic  surface  charges,  which  can  attract  ions  in  liquid  flows  to 
form  an  electric  double  layer  (EDL)  (see  Section  3.2).  The  thickness  of  the  EDL  varies  from  a 
few  nm  to  100s  of  nm  (Hunter  1981),  which  can  be  comparable  to  the  order  of  micro-flow  length 
scale.  In  these  cases,  the  bulk  flow  can  be  affected  by  this  electrically  charged  layer  (Mohiuddin 
Mala  et  al  1996). 

3.  SURFACE  FORCES 

For  fluid  flows  in  MEMS,  new  phenomena  arise  because  of  certain  surface  forces  that  are  usually 
ignored  in  macro  scales.  Here,  a  brief  survey  is  given  on  several  kinds  of  surface  forces 
(Israelachvili  1991).  Before  the  discussion  of  some  seemingly  different  surface  forces,  it  is 
important  to  know  that  these  forces  originate  from  intermolecular  forces.  Moreover,  even  though 
basic  intermolecular  forces  are  short  range  (<  1  nm)  in  nature,  they  can  cumulatively  lead  to  very 
long-range  (>  0.1  jxm)  effects  (e.g.,  surface-tension  effects  in  liquids).  Another  important  point  is 
that  all  intermolecular  forces  are  fundamentally  electrostatic  (coulombic).  This  is  established  by 
the  Hellman-Feynman  theorem  that  once  the  spatial  electron  distribution  is  determined  from  the 
Schrodinger  equation,  all  intermolecular  forces  can  then  be  calculated  using  classical 
electrostatics.  However,  in  practice  this  cannot  edways  be  done,  and  empirical  or  semi-empirical 
laws  of  forces  are  still  useful.  Therefore  the  following,  we  treat  the  following  surface  forces 
differently,  even  though  they  are  quantum  mechanically  the  same  in  origin. 

3.1  The  van  der  Wools  Forces 

The  van  der  Waals  forces  are  the  weakest  among  all  the  forces,  but  they  are  important  because 
they  are  always  present.  The  van  der  Waals  forces  are  short  range  in  nature  but,  in  cases  where 
large  molecules  or  surfaces  are  involved,  they  can  produce  an  effect  longer  than  0.1  pm.  In 
general,  the  van  der  Waals  forces  have  three  parts:  orientation  force,  induction  force,  and 
dispersion  force.  All  have  an  interaction  free  energy  that  varies  with  the  inverse  sixth  power  of 
the  distance  (1/r^)  and  are,  hence,  short  range.  The  orientation  force  is  the  dipole-dipole 
interaction  force  between  polar  molecules.  The  induction  force  arises  from  the  interaction 
between  a  polar  molecule  and  a  non-polar  molecule.  The  permanent  dipole  of  the  polar  molecule 
induces  a  weak  dipole  in  the  non-polar  molecule  and  then  produces  a  dipole-induced  dipole- 
interaction  force.  The  dispersion  force  is  then  the  induced-dipole—induced-dipole  interaction 
force.  Interestingly,  the  dispersion  forces  act  on  all  atoms  and  molecules  even  when  they  are 
totally  neutral,  as  are  those  of  helium  and  oxygen.  The  source  of  the  dispersion  force  between  two 
non-polar  molecules  is  the  following:  Although  the  averaged  dipole  moment  of  a  non-polar 
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molecule  is  zero,  at  any  instant  there  exists  a  finite  dipole  moment  depending  on  the  exact  position 
of  the  electrons  around  its  nucleus.  This  instantaneous  dipole  moment  can  then  generate  an 
interaction  force  with  nearby  molecules. 

Altogether,  the  van  der  Waals  forces  play  an  important  role  in  many  macroscopic  phenomena 
(e.g.,  adhesion,  surface  tension,  physical  adsorption,  wetting  of  surfaces,  properties  of  thin  films, 
and  behaviors  of  condensed  proteins  and  polymers).  In  MEMS,  the  van  der  Waals  forces  can  have 
significant  effects  in  structures  with  large  surface-to-volume  ratios  (e.g.,  long  and  thin  polysilicon 
beams  (Mastrangelo  &  Hsu  1992)  and  large-and-thin  comb-drive  structures  (Tang  et  al  1989)) 
whenever  they  are  in  contact  with  another  surface.  Stiction  or  adhesion  of  the  structure  to  the 
substrate  can  often  be  observed  as  a  major  problem  in  the  operation  of  these  structures. 
Nevertheless,  the  van  der  Waals  forces  between  two  contacting  surfaces  are  in  many  cases  hard  to 
be  separately  distinguished  fi’om  electrostatic  (coulombic)  forces,  which  are  discussed  in  the  next 
section. 

3.2  Electrostatic  Force 

Electrostatic,  or  coulombic,  force  is  present  between  charged  molecules  or  particles.  The  force 
has  an  inverse-square  dependence  on  the  distance,  1/r^,  so  it  is  rather  long  range  when  compared 
to  the  van  der  Waals  forces.  In  MEMS  devices,  the  electrostatic  force  can  have  a  significant  effect 
even  up  to  10  |xm  away  and  becomes  more  important  when  lengths  are  less  than  0.1p.m.  One  can 
always  produce  an  electrostatic  force  by  providing  an  electrical  potential  difference  between  two 
electrodes.  However,  problems  due  to  electrostatic  force  in  MEMS  often  occur  because  of  rather 
uncontrollable  surface-trapped  charges.  In  fact,  any  surface  is  likely  to  carry  some  charge,  because 
of  broken  bonds  and  surface  charge  traps.  In  the  case  where  the  surface  is  a  good  insulator,  such 
as  with  Si02,  trapped  charges  can  induce  very  high  voltage  fi-om  a  few  100  to  a  few  1000  volts 
(Wolf  1990). 

For  charged  surfaces  in  Uquids  (e.g.,  water),  new  phenomena  happen  mainly  as  a  result  of  charge 
redistribution  in  the  liquid.  Basically,  the  final  surface  charge  is  balanced  by  counterions  in  the 
liquid  with  an  equal  but  opposite  total  charge.  The  surface  electrical  potential  attracts  counterions 
to  the  wall  and  forms  a  thin  (<  1  nm)  layer  of  immobile  ions.  Outside  of  this  layer,  the  distribution 
of  the  counterions  in  liquid  mainly  followed  the  exponential  decaying  dependence  away  fi'om  the 
surface,  which  is  called  the  diffuse  electric  double  layer  (EDL).  The  EDL  has  a  characteristic 
length,  the  Debye  length,  which  depends  inversely  on  the  square  root  of  the  ion  concentration  in 
the  Bqmd.  For  example,  in  pure  water  the  Debye  length  is  about  1  pm;  in  1  M  of  NaCl  solutioi^ 
the  Debye  length  is  only  0.3  nm.  Inside  the  EDL,  a  very  large  electrostatic  force  then  exists.  This 
may  cause  a  behavior  change  in  the  fluid  flow  if  the  double  layer  thickness  is  significant  compared 
to  the  flow  field  size  (Mohiuddin  Mala  et  al  1996).  This  is  especially  true  in  dilute  solutions  where 
the  Debye  length  is  large. 

3.3  Steric  Forces 

This  is  a  special  case  involwng  chain  molecules  (e.g.,  polymers)  attached  at  the  surface  on  one 
end  with  the  other  end  dangling  into  the  solution  (liquid  for  most  of  the  cases)  where  they  are 
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mobile.  A  different  class  of  forces,  known  as  steric  forces,  arises  whenever  another  molecule  or 
surface  approaches  and  is  a  result  of  an  entropy  change  caused  by  the  confined  chain  molecules. 
The  complex  molecules  can  produce  complex  interactions,  and  steric  forces  can  be  either 
attractive  or  repulsive.  They  can  be  rather  long  range  (>  0.1  pm),  and  they  are  important  when  a 
fluid  flow  has  a  significant  number  of  long-chain  molecules. 


4.  FLOWS  IN  MICRO  CONFIGURATIONS 

Fluids  driven  by  pumps  flowing  through  chaimels  and  valves  are  generic  configurations  in 
biomedical  analytical  systems.  When  the  sizes  of  these  devices  are  in  the  micron  range,  the 
measured  data  show  different  behaviors  fi’om  those  expected  in  larger  devices.  The  exact  physical 
mechanisms  are  not  known,  although  the  surface  forces,  which  were  not  considered  in  classical 
analyses,  are  believed  to  be  responsible  for  these  interesting  phenomena.  This  provides  new 
research  opportunities.  In  this  review,  we  will  limit  the  discussion  to  simple  fluids,  which  have 
small  molecules.  More  complex  fluids  (e.g.,  non-Newtonian  or  multi-phase  fluids)  are  commonly 
used  in  biomedical  systems.  Much  richer  findings  are  expected  in  the  fixture. 

4.1  Gas  Flows  in  Micro  Channels 

Flow  through  a  straight  chaimel  is  one  of  the  simplest  but  most  common  configurations  in  micro 
fluidic  systems.  Mass  flow  rates  in  small  chaimels  with  diameters  of  about  30  microns  were 
measured  by  Knudsen  (1909)  while  studying  the  non-slip/slip  boundary  condition.  Recent 
interests  are  triggered  by  micromachine  activities  (Pfahler  et  al  1990),  which  include  applications 
for  transporting  fluids  in  biomedical  diagnosis  and  electronic  device  cooling  (Tuckermann  & 
Pease  1982,  Joo  et  al  1995).  Helium  is  a  common  gas  used  in  most  experiments,  because  it  has  a 
large  mean  fi-ee  path  (about  2  x  10'’  m  under  laboratory  conditions).  The  Knudsen  number  based 
on  a  chaimel  height  of  one  micron  is  0.2.  A  micro  channel  with  integrated  micro  pressure  sensors 
(Fig.  2)  was  fabricated  to  study  the  flow  field  (Liu  et  al  1993b).  This  channel  has  a  cross  section 
of  1.2  microns  x  40  microns.  Slip  flow  is  observed,  and  the  measured  mass  flow  rate  (Pfahler  et  al 
1991,  Pong  et  al  1994,  Arkilic  et  al  1995,  Harley  et  al  1995,  Liu  et  al  1995,  Shih  et  a^  1995,  Shih 
et  al  1996)  is  higher  than  that  based  on  the  non-slip  boundary  condition  (Fig.  3).  For  other  gases 
(e.g.  nitrogen,  oxygen,  and  nitrous  oxide),  the  Knudsen  number  is  about  a  factor  of  four  smaller, 
but  surface  slip  still  exits.  The  mass  flow  rate  can  be  calculated  fi'om  the  Navier-Stokes  equation 
wth  a  slip  boundary  condition  (Kennard  1938,  Beskok  &  Kamiadakis  1992  &  1993,  Arkilic  & 
Breuer  1993).  An  accommodation  constant  is  introduced  to  represent  the  tangential  momentum 
transfer  between  the  impinging  molecules  and  the  wall.  The  value  of  the  constant  should  be  <  1. 
However,  the  predicated  mass  flow  rate  is  sensitive  to  the  accommodation  constant  (Fig.  3), 
which  actually  fimctions  as  a  matching  coeflSdent.  Direct  simulation  of  the  Monte  Carlo  method 
(DSMC)  has  been  carried  out  by  many  investigators  (Oh  et  al  1995,  Piekos  &  Breuer  1995  & 
1996,  Beskok  et  al  1996,  Oran  et  al  1998).  The  mean  streamwise  velocity  in  the  micro  channel  is 
typically  in  the  very  low  subsonic  range  (<  1  m/s),  which  can  be  several  orders  of  magnitude 
smaller  than  the  molecular  thermal  velocity  of  1000  m/s 
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(Oh  et  al  1995).  Computing  the  converging  solution  is  a  challenge  for  very  low  Mach-  number 
flows. 


Micro-channel 


Inkt/Outiet 


Figure  2  A  micro  channel  system  vith  integrated  micro  pressure 
sensors  (Pong  et  al  1994). 


Figure  3  Mass  flow  rate  and  pressure  drop  of  helium  in  a  micro  channel 
(Shihetall996). 

In  the  micro  channel,  hi^  pressure  drops  are  observed.  This  is  because  of  the  small  transverse 
dimension,  which  causes  high  'wscous  dissipation.  A  drop  of  a  few  atmospheres  in  pressure  in 
several  mm  is  common  ^ong  et  al  1994,  SWh  et  al  1995).  The  density  of  the  gas  can  change  to 
such  a  large  extent  that  the  pressure  does  not  decrease  linearly  with  streamwise  distance  as  in 
typical  creeping  flows.  Rather,  the  compressibility  effect  causes  the  pressure  to  decrease  at  a 
slower  rate.  On  the  other  hand,  the  raitfaction  effect  due  to  the  Wgh  Knudsen  number  works 
against  the  compressibility  and  keeps  the  pressure  towards  the  linear  distribution  (Atklic  &  Breuer 
1993).  The  two  effects  are  not  equal,  so  the  net  result  is  a  non-linear  pressure  distribution.  Since, 
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the  pressure  distribution  is  not  sensitive  to  the  accommodation  constant,  it  turns  out  to  be  a  useful 
property  for  examining  the  analytical  results.  When  the  accommodation  constant  is  varied,  no 
appreciable  change  in  the  predicted  pressure  distribution  can  be  observed.  The  measured  pressure 
distributions  along  the  channel  are  plotted  against  the  theoretical  prediction.  The  measured 
pressure  distribution  agrees  well  with  the  analytical  result. 


Figure  4  Pressure  distribution  of  nitrous  oxide  in  a  micro  channel 


4.2  Liquid  Flows  in  Micro  Channels 

Even  though  the  non-slip  boundary  of  simple  liquids  with  molecules  is  established  by  experimental 
studies  and  by  molecular  dynamics  simulation  (Koplik  1989),  the  study  of  liquid  flow  through 
micro  chaimels  is  not  a  routine  process.  In  fact,  it  seems  to  be  an  even  richer  problem  than  gas 
flow.  For  liquid  flows  through  capillary  tubes  (Migun  &  Prokhorenko  1987)  or  micromachined 
charmels  (Pfahler  et  al  1990,  1991),  the  measured  flow  rates  and  pressure  drops  across  the 
channel  were  compared  with  the  Stokes  flow  solution,  with  viscosity  as  the  matching  constant. 
The  size  effects  are  apparent  in  these  results,  and  the  value  of  viscosity  deviates  from  the 
conventional  value  for  micron-size  channels.  The  molecular  structure  of  the  liquid  also  affects  the 
flow.  Molecules  with  no  electrical  charge  can  have  a  dipole  configuration  (e.g.,  water).  Pfahler  et 
al  (1991)  found  that  the  value  of  the  viscosity  of  polar  isopropanol  decreases  from  the  nominal 
value  for  a  channel  hdght  smaller  than  40  microns  and  reaches  an  asymptotic  value  at  a  channel 
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height  of  about  10  microns  (Fig.  5).  The  vertical  axis  is  the  ratio  between  the  apparent  viscosity 
and  nominal  viscosity.  It  is  reasonable  that  the  size  effect 

becomes  more  pronounced  for  a  narrow  channel.  However,  it  is  interesting  to  note  that  the 
apparent  viscosity  is  lower,  not  higher,  in  the  narrower  channel.  The  apparent  viscosity  actually 
represents  the  integral  effects  of  the  surface  forces.  More  definitive  experiments  are  needed  to 
identify  the  role  of  specific  surface  effects.  For  example,  when  the  surface-to-volume  ratio  gets 
large,  will  the  surface  viscous  force  also  be  a  fiinction  (i.e.,  hydrophobic  or  hydrophilic)  of  the 
surface  property?  For  non-polar  silicon  oil,  the  data  (Fig.  5)  can  not  support  a  clear  trend  of 
viscosity  variation  in  this  range  of  chaimel  size. 


Figure  5  Size  effect  on  liquid  flow  in  channels  (Pfahler  et  al  1991). 

4.3  Diffusion  in  Micro  Channels 

The  sacrificial  etching  technique  (Nathanson  1967,  Preston  1972,  Howe  &  Muller  1983)  is  a 
fijndamental  technique  used  in  surface  micromachining  for  producing  a  fi’ee  standing  structure, 
such  as  the  diaphragm  above  a  cavity  or  cantilever  beam.  The  micro  channel  shown  in  Fig.  2  was 
also  fabricated  by  this  method.  The  sacrificial  etching  process  for  making  that  chaimel  was  to 
deposit  a  thin  line  of  sacrificial  material,  in  this  case  phosphosilicate  glass  ^SG),  after  which  a 
structural  layer  of  silicone  nitride  was  placed  on  top.  Hydrofluoric  acid  (HF)  was  used  as  an 
etchant  to  remove  the  PSG  fi'om  one  end  of  this  thta  line.  When  the  PSG  was  etched  away,  a 
long  micro  channel  was  formed  (Monk  et  al  1993,  Liu  et  al  1993a).  The  chaimel  in  Fig.  2  is  1.2 
microns  high  and  4000  microns  long. 

This  process  is  an  important  micromachine  manufacturing  technique,  and  it  also  motivates  us  to 
examine  mass  diffusion  in  micro  geometries.  The  HF  difihises  fi-om  the  reservoir,  which  is  located 
at  one  end  of  the  miCTO  chaimel,  towards  the  PSG/HF  interftice.  The  rate  at  which  the  solid  PSG 
dissolves  into  the  HF  add  is  a  fimction  of  the  reaction  rate  and  the  add  concentration.  The 


9 


concentration  of  HF  at  the  moving  interface  is  dictated  by  the  diffusion  process  and  the  reaction 
rate.  Basically,  this  phenomenon  is  governed  by  a  1-D  diffusion  equation  with  an  unknown 
boundary  condition.  It  is  similar  to  the  Deal-Grove  problem,  but  the  reaction  rate  changes  its 
power  dependence  on  the  acid  concentration  (Judge  1971)  in  the  present  case.  Nevertheless,  a 
close-form  analytical  solution  can  be  obtained  (Liu  1993a),  and  it  matches  well  with  the  measured 
data  (Fig.  6).  On  the  other  hand,  a  size  effect  is  observed.  The  etch  rate  is  not  a  constant  during 
the  etching,  and  it  decreases  as  etching  time  increases.  During  the  whole  etching  period,  the  etch 
rate  of  a  thicker  channel  is  always  higher  than  that  of  a  thin  channel  (Liu  et  al  1993a).  The  etch 
rate  at  the  beginning  of  the  etching  process  is  plotted  in  Fig.  7,  and  it  almost  decreases  linearly 
with  the  channel  thickness  in  the  tested  range  of  1.2  microns  to  0.25  microns.  This  is  not 
expected  from  the  1-D  diffusion  analysis.  There  are  many  types  of  ions  in  the  liquid  phase.  It  is 
possible  that  the  thickness  of  the  EDL  in  this  case  is  on  the  order  of  the  thickness  of  the  micro 
channel.  The  F  '  ions  and  HF2 '  ions  are  responsible  for  removing  the  PSG,  and  their  diffusion  is 
very  likely  to  be  retarded  by  the  electric  double  layer. 


Figure  6  The  progress  of  the  etching  front  as  a  function  of  time  for  various  HF  concentrations 

(Liu  et  al  1993a).  ■  -  49%,  ♦-  38.2%,  A  -  26.5%,  “  -  13.8%,  x  -  7.2%,  •  -  3.6%.  Data 

points  are  plotted  against  predicted  curves. 
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Figure  7  Size  effect  as  a  function  of  etching  rate. 


4.4  Flow  Through  Micro  Nozdes 

Flow  through  a  nozzle  experiences  viscous  dissipation.  In  the  case  of  Stokes  flow,  the  pressure 
drop  increases  with  decreasing  Reynolds  number  --  a  well-established  result.  A  recent  experiment 
(Hasegawa  et  al  1997)  shows  that  the  predicted  pressure  drop  underestimates  the  measured  value 
when  the  size  of  the  nozzle  is  smaller  than  35  microns  (Fig.  8).  The  excess  pressure  drop  can  be  a 
factor  of  four  or  five  times  higher  than  the  predicted  value  for  water  flow  through  an  8.8-micron 
nozzle.  When  non-polar  silicon  oil  was  used,  the  excess  pressure  drops  are  lower  than  that  of 
water,  which  has  polar  moleculars.  The  dependence  of  the  measured  flow  property  on  the  fluids 
used  in  the  test  is  a  common  feature  in  micro  flows  whenever  a  size  effect  is  observed.  It  is 
interesting  to  note  that  differences  between  polar  and  non-polar  materials,  which  has  been 
reported  in  micro  channel  flow  (Pfahler  et  al  1991),  is  observed  again  in  this  flow. 

It  has  been  reported  that  the  EDL  retards  the  flow.  This  causes  an  apparent  viscosity  that  is 
higher  than  the  nominal  wscosity  (Mohiuddin  Mala,  Li  &  Dale  1996).  This  is  the  opposite  of  the 
results  observed  in  the  micro-channel  liqmd-flow  data  pfahler  et  al  1991)  but  follows  the  trends 
reported  in  the  micro-nozzle  flow  experiment  (Hasegawa  et  al  1997).  In  this  nozzle  expCTiment, 
the  ion  content  in  the  water  was  varied  by  adding  NaCl  in  order  to  examine  the  effect  of  the  EDL. 
No  discernible  difference  in  the  pressure  measurements  was  reported  by  the  authors. 
Unfortunately,  distilled  water  rather  than  de-ionized  water  was  used  before  the  NaCl  was  added. 
Water  is  known  to  contain  ions  earily,  but  the  level  of  the  ion  content  in  their  distilled  water  is 
unknown.  Nevertheless,  the  excess  pressure  drop  across  the  micro  nozzle  provides  additional 
evidence  for  the  size  effect  in  the  micro  flows. 
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Figure  8  Size  effect  of  pressure  drop  across  micro  nozzles  as  a 
function  of  Re  number  (Hasegawa  et  al  1997).  (o  -  8.8  pm, 

□  -  9.2  pm,  A  -  13  mm,  0-27  pm,  +  -  35  pm,  x  -  109  pm 
numerical  simulation) 


4.5  Air  Damping  in  MEMS 

Mechanical  or  electromechanical  resonant  devices  have  been  widely  used  in  oscillators  or  filters, 
and  operational  theories  for  these  devices  have  been  established  in  large  apparatus.  As  MEMS 
technology  develops,  these  resonators  are  becoming  smaller.  Typically,  these  devices  are  1-  to  2- 
pm  thick  and  about  100  x  100  pm^  in  area.  Some  important  milestones  for  these  kinds  of  devices 
are  the  laterally  driven  comb-drive  polysilicon  micro  resonators  (Tang  et  al  1989),  Analog 
Device’s  integrated  polysilicon  ADXL-50  accelerometers  (Analog  Device  1991),  flexural 
polysilicon  micro  gyros  (Juneau  &  Pisano  1996),  and  the  cascade  300-kHz  micromechanical 
filters  (Wang  &  Nguyen  1997).  In  these  cases,  the  micro  flows  are  shear-driven  (Bekok  et  al 
1996)  instead  of  pressure-driven,  as  in  the  channel  and  nozzle  flows. 

Air  damping  in  microstructures  is  an  under-explored  but  important  issue,  because  it  directly 
influences  the  quality  (Q)  factor,  of  the  devices.  For  example,  it  has  been  shown  (Nguyen  1995) 
that  the  Q  factor  of  a  polysilicon  resonator  can  be  as  high  as  100,000  in  a  vacuum  but  drops  to 
100  in  atmosph^c  air.  This  actually  illustrates  that  air  damping  may  be  the  most  important  fector 
when  compared  to  other  effects  like  thermal  vibration  and  fatigue.  The  strong  air  damping  is  due 
to  the  dramatic  increase  in  surface-to-volume  ratio. 

In  order  to  systematically  investigate  air  damping,  the  air  pressure  is  divided  into  three  regions 
(Newell  1968).  In  the  first  region  where  the  pressure  is  in  low  vacuum,  air  damping  is  negligible 
when  compared  to  the  intrinsic  damping  (internal  fiiction)  within  the  resonator.  E7q)erimentally, 
this  re^on  happens  roughly  below  10-100  Pa  for  micro-resonators.  The  damping  in  this  region  is 
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largely  dependent  on  the  surface-to-volume  ratio,  the  surface  quality  (the  surface  may  be  more 
lossy  than  the  bulk  material),  and  the  material.  In  the  second  region,  momentum  transfer  between 
the  resonator  and  individual  air  molecules  dominates  the  damping.  Here,  there  is  little  or  no 
interaction  between  air  molecules,  and  a  simple  model  has  been  derived  based  on  the  assumption 
that  the  rate  of  momentum  transfer  is  proportional  to  the  difference  in  velocity  between  the  air 
molecules  and  the  resonators  (Christian  1966).  The  result  is  that  the  Q  factor  is  inversely 
proportional  to  the  ambient  pressure  (Q  =  1/p).  This  region  typically  ends  around  10  Pa.  In  the 
third  region,  the  pressure  is  high  enough  that  air  molecules  do  interact  with  each  other,  and 
viscous  damping  dominates  the  behavior  of  the  resonators.  In  this  region,  the  Q  factor  is  inversely 
proportional  to  the  fluid  viscosity  (Q  =  1/p).  The  boundary  condition  must  change  from  slip  to 
non-slip  flow  when  the  surrounding  pressure  increases.  What  is  the  Knudsen  number  for  this 
change  in  the  boundary  condition?  Do  other  surface  forces  play  a  role  in  resonator  performance? 
There  is  not  enough  experimental  evidence  to  answer  these  questions  yet.  When  the  viseous 
damping  of  laterally  driven  microstructures  was  studied  (Zhang  &  Tang  1994),  it  was  fourid  that 
the  simple  Couette  and  Stokes  flow  model  can  significantly  underestimate  the  damping  in 
microstructures,  and  that  this  discrepancy  is  larger  for  smaller  resonators.  Most  of  the 
microstructures  have  a  complex  configuration.  For  example,  the  comb  structure  and  the 
surrounding  walls,  can  also  play  an  important  role.  The  detailed  geometry  of  the  device  (c.g.,  the 
resonator’s  thickness  and  edges)  needs  to  be  considered  for  more  accurate  modeling.  An  example 
illustrating  the  importance  of  the  geometry  details  is  the  dynamic  response  of  a  thin  squeeze-fflm. 
In  a  numerical  simulation  of  a  micro  accelerometer  (Yang  &  Senturia  1996),  the  squeeze-film  in  a 
low  air-pressure  environment  has  a  frequency  response  about  four  orders  of  magnitude  better  than 
at  atmospheric  pressure  conditions.  When  the  geometry  is  slightly  altered  by  placing  sma.ll 
etching  holes  through  the  accelerometer  film,  the  flat  portion  of  the  frequency  response  is 
extended  by  three  decades  from  that  of  a  solid  film  (Fig.  9). 


Figure  9  The  dynamic  response  of  a  thin  squeeze-film.  (Yang  and  Senturia  1996) 

For  some  MEMS  devices,  thdr  displacement  can  be  much  larger  than  their  transverse  dimension. 
Other  interesting  air-damping  problems  arise.  A  flap  device  (Nfiller  et  al  1996)  driven  by  an 
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electro-magnetic  field  has  an  off-plane  displacement  a  thousand  times  larger  than  the  thickness  of 
the  flap.  The  measured  Q  factors  depend  on  the  magnitude  of  the  displacement.  It  is  believed  that 
the  non-linear  viscous  damping  contributes  to  this  phenomenon.  This,  again,  is  a  rather 
unexplored  issue. 

4.6  Micro  Flow  Diagnosis 

Measuring  flow  properties  in  a  micro  configuration  is  a  challenging  task,  because  the  sensors  need 
to  be  much  smaller  than  the  size  of  the  device  under  study.  In  addition,  the  momentum  and  the 
energy  of  the  flow  is  very  small.  For  example,  the  kinetic  energy  flux  in  the  channel  with  helium 
flow  (Fig.  2)  is  about  5.4  x  J/s  for  inlet  pressure  at  20  psig.  Therefore,  only  an  extremely 
small  amount  of  momentum  and  energy  exchange  between  flow  and  sensor  is  allowed  so  as  to  not 
alter  the  flow.  For  these  reasons,  a  very  limited  number  of  sensors  have  been  developed  for  micro 
flow  measurements.  The  micro  channel  with  integrated  pressure  sensors  (Fig.  2)  is  one  example. 
A  very  small  duct  (0.25-micron  high  and  one-micron  wide)  is  used  to  connect  not-very-small 
(compared  to  the  size  of  the  channel)  micro  sensors  to  the  channel  for  increased  spatial  resolution 
Time  averaged  pressure  can  be  obtained  by  this  arrangement.  If  unsteady  pressure  data  are 
needed,  questions  about  the  calibration  procedures  and  even  the  physical  meaning  of  the  data  may 
arise.  Furthermore,  even  though  the  pressure  sensor  has  a  high  frequency  response,  we  do  not 
know  the  constraints  of  the  small  duct  imposed  on  the  unsteady  pressure  field.  For  temperature 
measurement,  thermocouples  of  4  microns  x  4  microns  have  been  developed  for  measuring 
surface  temperature  along  a  20-micron  channel  (Fig.  10,  Zohar  et  al  1996).  In  high  Knudsen- 
number  flows,  a  temperature  jump  can  occur  between  the  wall  and  the  bulk  flow.  Fabricating  an 
in-flow  temperature  sensor  without  introducing  large  disturbances  to  the  micro  flow  is  not  a 
trivial  task. 


Figure  10  The  in-situ  temperature  sensor  in  a  micro  channel  (Zohar  et  al  1996). 

Flow  visualization  has  been  proven  to  be  a  very  useful  technique  in  macro  fluid-dynamics 
research,  and  extensive  effort  has  been  expended  on  developing  methods  for  visualizing  flow  in 
micro  channels  and  valves  (Lanzillotto  et  al  1996).  The  wavelength  of  visible  light  is  not  short 
enough  for  this  purpose,  so  X  rays  (with  wavelengths  of  0.62  angstrom),  generated  by  a 
synchrotron,  are  used.  An  advantage  is  that  they  can  be  utilized  for  visualizing  flows  behind 
certain  silicon  structures,  such  as  polysilicon,  which  are  not  transparent  to  visible  light.  Figure  1 1 
shows  the  motion  of  a  water/air  interface  during  a  filling  process  inside  a  100-micron-deep  and 
140-micron-Avide  channel  with  a  triangular  cross  section.  Water  climbs  along  the  sharp  comers  of 
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the  triangle  at  a  rate  so  fast  that  it  is  beyond  the  resolution  limit  of  the  current  technique.  The 
meniscus  surface  in  the  middle  of  the  channel  is  left  far  behind  the  comer  interface.  This 
phenomenon  is  caused  by  the  large  surface-tension  force  associated  with  the  extremely  small 
comer  curvature  of  the  micro  channel. 


Figure  11  Flow  visualization  by  X  ray  in  a  micro  channel  (Lanzillotto  et  al  1996). 

5.  FLOW  TRANSDUCERS 

A  typical  micro  transducer  is  about  100  microns  in  size,  which  is  at  least  one  order  of  magmtude 
smaller  than  the  traditional  transducers.  The  improvement  in  spatial  resolution  comes  naturally, 
since  the  significant  reduction  of  inertial  mass  and  thermal  capacity  can  increase  the  fi-equency 
response  to  an  inaudible  fi-equency  (>  1  MHz  for  a  micro  hot  wire  (Jiang  et  al  1994)).  The 
lithographic  process  and  the  small  size  enable  us  to  use  100,  1000,  or  even  more  transducers  for 
sensing  and/or  controlling  a  phenomenon  that  has  distributed  features,  such  ^  the  high  speed 
streaks  in  a  turbulent  boundary.  However,  a  few  problems  are  associated  with  using  multiple 
transducers.  Traditionally,  the  leads  of  each  transducer  are  connected  to  an  external  signal¬ 
conditioning  instmment.  Handling  100s  or  1000s  of  signal  paths  is  tedious  and  usually 
prohibitive.  In  addition,  these  leads  occupy  a  large  portion  of  the  precious  surface  area  on  the 
chip.  For  example,  the  surface  shear-stress  sensor  array  containing  85  sensors  (Jiang  et  al  1996) 
only  takes  about  1%  of  the  area,  while  the  leads  occupy  about  50%  of  the  surface.  Another 
problem  is  the  limitation  in  signal  bandwidth.  Depending  on  the  number  of  transducers  used,  we 
may  sometimes  not  be  able  to  take  full  advantage  of  the  high-fi-equency  response.  However, 
these  problems  can  be  alleviated  by  integrating  the  signal  conditioning,  and  even  the  decision 
circuits,  with  the  micro  transducers  in  such  a  way  that  the  leads  can  be  connected  locally  and 
thereby  agnificantly  reduce  the  required  surface  area.  The  output  and  input  then  can  be  only  a 
few  wires.  Furthermore,  the  locd  drcuit  will  only  process  the  output  firom  one  or  a  few 
transducers,  so  bandvridth  limitation  vrill  no  longer  be  a  problem.  However,  the  signal  processing 
drcuitiy  still  needs  to  be  kept  simple.  It  is  not  feasible  to  integrate  a  powerful  microprocessor 
with  each  sensor-actuator  pair  in  a  large  transducer  array.  Neural  networks  are  a  fe^le 
approach,  and  numerical  experiments  (Lee  et  al  1997)  show  successful  examples  of  using  simple 


neural-network-based  processing  for  controlling  transitional  or  turbulent  flows.  Another  issue  is 
the  energy  expenditure  of  the  transducers.  In  the  case  of  flow  control,  we  usually  need  to  have  a 
net  energy  saving  for  the  whole  system.  Typically,  the  energy  requirement  for  a  micro  flow 
sensor  can  be  kept  around  a  few  milliwatts,  but  the  energy  expended  for  micro  actuators  is  usually 
much  higher.  Improvements  in  this  area  are  clearly  needed. 

5.1.  Micro  Flow  Sensor 

5.1.1  HOT-WIRE  ANEMOMETER  Micro-machined  polysilicon  hot  wires  (Jiang  et  al  1994) 
can  have  a  cross  section  as  small  as  0.5  x  0.5  pm^  and  a  length  of  from  one  to  several  hundredths 
of  a  micron  (Fig.  12),  and  their  utilization  can  achieve  significant  increase  in  spatial  resolution. 
There  are  many  other  advantages  of  using  polysilicon.  One  is  that  its  temperature  coefficient  of 
resistance  (TCR)  of  polysilicon  can  be  varied  from  -2  %  to  0.2  %/°C  by  using  a  range  of  doping 
concentration  of  boron  or  phosphorus  from  10*^  to  10^®  atoms/cm^.  By  comparison,  conventional 
platinum  wires  only  have  a  TCR  of  around  0.1  %/°C.  We  can  use  this  large  TCR  variation  to 
design  a  hot  wire  that  specifically  exhibits  thermal  sensitivity.  The  other  advantage  of  the 
poly  silicon  wire  then  is  that  a  hot  wire’s  time  constant  is  inversely  proportional  to  its  resistivity 
(Blackwelder  1981).  The  resistance  of  conventional  hot  wires,  which  use  metals  exclusively,  is 
typically  low  (5-30  Q).  On  the  other  hand,  the  resistance  of  polysilicon  wires  can  be  adjusted 
over  a  wide  range  (from  kO  to  several  100  kO).  When  a  constant  temperature  mode  is  used,  a 
bandwidth  of  1.4  MHz  has  been  achieved  with  the  aid  of  a  heavily  doped  polysilicon  wire.  The 
extremely  high-frequency  response  of  a  micro  hot  wire  is  mainly  achieved  by  adjusting  the 
material  properties  rather  than  by  simply  reducing  its  size. 


(a)  (b) 

Figure  12  Micro  hot  wire:  (a)  a  1-pm  x  70-pm  single  hot  we  and  (b)  a  hot-wire  array 
(Jiang  et  al,  1994). 

5.1.2  SHEAR  STRESS  SENSOR  A  thermal  surface  shear-stress  sensor  is  a  hot  film  placed  on 
a  surface  (Haritonidis  1989).  This  sensor  has  reasonable  sensitivity  while  it  is  used  in  water. 
When  air  is  the  working  medium,  most  of  the  heat  is  conducted  into  the  substrate  rather  than  into 
the  air,  because  of  the  very  low  heat  capacity  of  air.  The  use  of  a  surface-micromachined  shear- 
stress  sensor  (Jiang  et  al  1996,  Huang  et  al  1995,  1996)  has  resolved  the  problem.  A  polysflicon 
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wire  is  placed  on  a  1.2-pm-thick  and  200  x  200-pm^  silicon-nitride  diaphragm.  Underneath  the 
silicon-nitride  diaphragm  is  a  2-pm-deep  vacuum  cavity  (Fig.  13  a).  Significant  thermal  insulation 
has  been  achieved  by  using  the  vacuum  chamber  and  the  silicon-nitride  diaphragm  with  low 
thermal  conductivity.  Figure  14  shows  that  the  resistance  of  the  polysilicon  wire  on  a  vacuum- 
insulated  diaphragm  is  almost  an  order-of-magnitude  higher  than  that  on  a  solid  substrate.  The 
micro-surface  shear-stress  sensor  has  a  sensitivity  of  100  mV/Pa  and  a  bandwidth  of  10  kHz  and 
higher.  This  type  of  micro  sensor  can  also  be  integrated  into  a  large  array  and  even  can  form  a 
flexible  skin  to  be  placed  on  a  curved  surface  (Jiang  et  al  1997).  Mechanically  the  flexible  skin 
(Fig.  13b)  consists  of  128  separated  silicon  islands  (1  mm  x  1  mm  x  80  pm)  that  are  connected 
together  by  a  thin  polyimide  film  (1-20-pm  thick).  The  shear  stress  sensors  are  built  on  the 
silicon  islands.  This  skin  has  been  taped  on  a  semi-cylindrical  (1.3-cm-diameter)  delta-wing 
leading  edge  to  perform  real-time  2-D  shear-stress  mapping.  Flow  separation  along  the  leading 
edge  has  then  been  successfully  measured  using  the  skins. 


Figure  13  Micro-surface  shear-stress  sensor;  (a)  a  single  sensor  (Jiang  et  al  1994)  and 
(b)  a  flexible  array  of  128  sensors  (Jiang  et  al  1997). 
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Figure  14  Heating  element  temperature  of  the  shear  stress  sensor  (Jiang  et  al  1994). 
(O  -  on  vacuum  chamber,  □  -  on  air-filled  chamber,  A  -  on  solid  substrate) 


A  different  design  of  a  thermal-based-wall  shear-stress  micro  sensor  has  been  reported  by  Lofdahl 
et  al  (1996).  In  this  case,  the  sensor  is  a  300  x  60  x  30-(im^  silicon  chip  that  utilizes  a 
temperature-sensitive  silicon  diode  as  the  temperature  sensing  element.  This  chip  is  embedded  in 
a  silicon  diaphragm  with  polymide-filled  trenches  to  improve  thermal  isolation.  This  device  takes 
about  40  mW  of  power  in  order  to  obtain  a  biased  differential  temperature  of  100°C.  The 
uniqueness  of  this  work  is  that  a  polysilicon-diaphragm  piezoresistive  pressure  sensor  is  also 
integrated  vith  the  shear  stress  sensor. 

By  suspending  a  micro  floating  element,  a  direct  measure  of  the  shear  stress  can  be  achieved. 
Schmidt  et  al  (1988)  first  reported  a  micro  floating-element  shear-stress  sensor.  It  was  designed 
to  provide  a  spatial  resolution  of  100  nm  and  a  bandwidth  of  20  kHz.  The  floating  element  was  a 
500  X  500  X  30-nm^  polyimide  plate  tethered  by  polyimide  beams.  When  exposed  to  shear  stress, 
the  plate  movement  is  sensed  capacitively  with  two  integrated  FET  transistors  and  off-chip 
electronics.  A  sensitivity  of  52  pV/Pa  was  reported.  Pan  et  al  (1995)  then  reported  a  different 
floating  element.  Pan’s  device  is  made  of  thin  film  polysilicon.  A  typical  device  includes  a 
polysilicon  floating  element  (~100  x  100  x  2.2  pm^)  and  several  suspension  beams  (—100  pm 
long).  These  floating  elements  have  much  smaller  sizes  and  better  sensitivities  (167  mV/Pa), 
owing  to  delicate  force-balance  electronics.  Padmanabhan  et  al  (1995)  reported  a  single 
crystalline-silicon  bulk-micromachined  and  silicon  fusion-bonded  sensor.  The  movement  of  the 
element  is  measured  by  two  photo  diodes  underneath  the  element. 
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5.1.3  PRESSURE  SENSOR  Micro  piezoresistive  pressure  sensors  represent  the  most  mature 
and  successful  application  of  MEMS  devices,  having  a  wide  range  of  applications  from 
automobiles  to  biomedical  instruments.  Micromachined  piezoresistive  silicon-based  pressure 
sensors  were  first  introduced  in  1958  by  Kulite,  Honeywell,  and  MicroSystems  (Brysek  et  al 
1990).  At  that  time,  the  devices  were  typically  made  with  silicon  piezoresistors  glued  to  metal 
diaphragms.  The  much  more  advanced  micro  pressure  sensors  used  today  are  made  by 
anisotropic  etching  of  silicon,  which  requires  no  hand  assembly.  Some  examples  are  the  fiilly 
integrated  Motorola  pressure  sensor  (Fraden  1993)  and  the  silicon  fusion-bonded  millimeter-  and 
sub-millimeter-size  pressure  sensors  (Brysek  et  al  1990).  Polysilicon  has  been  a  popular  material 
for  the  piezoresistors,  but  many  diaphragm  materials  have  been  explored,  such  as  polysilicon 
(Guckel  et  al  1986,  Sugiyama  et  al  1991)  and  silicon  nitride  (Liu  et  al  1995).  Piezoresistive 
pressure  sensors  are  available  in  a  wide  range  of  sensitivities  (up  to  10,000  psi)  and  with  typical 
uncompensated  full-scale  pressure  sensitivity  of  about  20  mVA^.  Capacitive  pressure  sensors 
have  the  advantages  of  better  sensitivity  and  less  temperature  drift  (Lee  1982,  Ko  1982); 
however,  their  performance  depends  heavily  on  electronics. 

5.1.4  TEMPERATURE  SENSOR  Many  different  types  of  temperature  sensors  have  long  been 
available  (Fraden  1993),  and  most  of  these  temperature-sensing  schemes  can  be  utilized  in  micro 
scales.  For  example,  the  silicon-resistive  and  pn-junction  temperature  sensors  are  widely  used  for 
temperature  compensation  in  silicon  diaphragm-type  pressure  sensors  (Brysek  et  al  1990).  An 
interesting  example  (Lahiji  and  Wise  1980)  is  an  IR  detector  that  uses  a  thin  diaphragm  to  absorb 
incoming  IR  light,  converting  the  photon  energy  into  heat.  Sixty  thin-film  thermal  couples  (Bi-Sb 
and  Au-polysilicon)  were  fabricated  in  series  on  a  2  x  2-mm^,  l-pm  thick  diaphragm.  The  device 
shows  an  impressive  sensitivity  of  30  VAV  and  a  time  constant  below  10  ms. 


5.2  Micro  Flow  Actuator 

Several  actuating  mechanisms,  including  electrostatic,  electromagnetic,  thermopneumatic,  etc., 
are  used  in  micro  actuators.  Among  them,  electrostatic  actuation  is  most  popular,  mainly 
because  of  ease  of  fabrication.  However,  the  electrostatic  actuation  has  intrinsic  limitations  on 
force  (~  pN)  and  displacement  (~  pm)  outputs;  they  can  be  used  to  study  flow  instability 
problems  where  only  small  displacement  is  required.  For  example,  Huang  et  al  (1996)  made  an 
electrostatic  actuator  to  control  screech  in  high  speed  jets.  With  the  design  of  70-pm  peak-to- 
peak  displacement  at  the  resonant  frequency  of  5  kHz  of  the  device,  it  has  been  shown  that 
proper  disturbances  in  the  jet  flow  can  be  provided. 

Electromagnetic  actuation  can  provide  much  larger  force  (100s  of  pN)  and  higher  displacements 
(mm).  For  example,  a  micro  magnetic  flap  (Miller  et  al  1996)  with  a  30-tum  copper  coil  and  a 
layer  of  permalloy  (Ni8oFe2o)  was  fabricated  on  a  4  x  4-mm^  silicon  plate  (40-pm  thick).  In  a 
magnetic  field  of  1  kG,  the  device  can  be  driven  to  a  rotational  angle  of  more  than  30°,  because  of 
the  induction  of  magnetization  in  the  permalloy.  In  addition,  a  ±100  mA  of  current  can  then  be 
applied  to  the  coil  to  move  the  flap  another  ±10°.  This  allows  total  control  of  the  flap  for  more 
than  2  mm  of  tip  motion  (Fig.  15).  These  and  other  similar  devices  have  been  shown  to  be 
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effective  in  changing  the  separation  of  a  flow  around  a  leading  edge  (Liu  et  al  1995)  and  to 
achieve  drag  reduction  in  boundary  layer  flow  (Tsao  et  al  1994,  Ho  et  el  1997). 

The  thermopneumatic  actuation  mechanism  is  a  technique  which  can  provide  force  (>  N)  and 
displacement  (>  mm)  even  larger  than  that  of  the  electromagnetic  method.  Thermopneumatic 
actuation,  first  demonstrated  by  Zdeblick  and  Angell  (1987),  showed  that  a  heater  in  a  liquid-filled 
cavity  can  heat  the  fluid  and  cause  significant  pressure  inside  the  cavity.  If  a  wall  of  the  cavity  is  a 
diaphragm,  it  will  be  pushed  outward,  because  of  the  differential  pressure  increase.  Since  the 
pressure  increase  can  easily  be  several  psi,  the  force  acting  on  the  diaphragm  is  large.  However, 
an  earlier  demonstration  used  only  flat  silicon  diaphragms,  so  the  deflection  is  limited  by  the 
mechanical  structures.  A  recent  demonstration  that  used  a  silicone  rubber  diaphragms  showed 
more  than  a  mm  displacement  with  about  10  psi  pressure  (Yang  et  al  1997).  It  is  foreseeable  that 


Figure  15  Micro  electro-magnetic  flap  actuator  (Miller  et  al  1996). 


6.  FLOW  CONTROL  BY  MICRO  SYSTEMS 

A  micro  system  that  contains  transducers  and/or  logic  circuitry  is  capable  of  sensing,  signal 
processing,  and  actuation.  Such  a  system  obviously  has  numerous  engineering  applications  (Gad- 
el-Hak  1993,  Ho  &  Tai  1996,  McMichael  1996,  Ho  et  al  1997).  A  few  examples  of  using  micro 
systems  for  flow  control  are  reviewed  here. 

6.1  Moments  Control  of  a  Delta  Wing 

On  a  delta  wing,  the  two  leading-edge  separation  vortices  contribute  up  to  40%  of  the  lift  at  high 
angles  of  attack.  If  we  can  break  the  symmetry  of  these  vortices,  a  net  rolling  torque  may  be 
achieved  that  can  be  used  for  maneuvering  the  wing.  Leading  edge  vortices  develop  fi’om  a 
boimdaiy  layer  separated  fi'om  the  surface.  In  free  shear  flows,  it  has  been  demonstrated  that 
minute  perturbations  at  its  origin  can  globally  modify  the  flow  field  (Ho  &  Huang  1982,  Oster  & 
Wygnanski  1982).  Therefore,  a  coupling  between  micro  actuators  and  wing-size  flow  structures 
can  be  made  by  manipulating  the  thin  bormdary  layer. 
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A  delta  wing  with  a  56.5"  sweep  angle  was  used  in  this  experiment  (Lee  et  al  1996).  Electro- 
magnetically  driven  microactuators  are  placed  along  the  leading  edges  with  a  circular  cross 
section.  The  location  of  the  actuators  on  the  leading  edge  is  indicated  by  an  angle,  G,  which  is 
measured  from  the  lower  surface,  0  =  0",  of  the  wing  towards  the  upper  surface,  0  =  180".  The 
measured  rolling  torques  are  normalized  by  the  torque  produced  by  the  vortex  lift.  My,  which  is 
the  product  of  the  lift  generated  by  a  single  leading-edge  vortex  and  the  distance  between  the 
centroid  of  the  half  wing  and  the  center  chord.  The  magmtude  of  the  maximum  torque  reaches 
15%  of  My  at  ~  0  =  60"  for  small  actuators  (Fig.  16).  When  the  actuators  match  the  thickness  of 
the  boundary  layer,  the  maximum  normalized  torque  can  be  about  40%  (Ho  et  al  1997).  A 
flexible  skin  containing  128  sensors  (Fig.  13b)  is  used  to  measure  the  surface  shear  stress.  The 
measurements  show  that  the  flow  separates  immediately  downstream  from  0  =  60°.  A  negative 
torque  was  found  for  actuators  placed  at  0  =  80".  When  actuators  are  placed  on  both  leading 
edges  but  at  two  angles  (0  =  60",  80"),  the  measured  resultant  torque  is  equal  to  the  sum  of  the 
two  peaks.  Therefore,  the  two  vortices  act  independently  on  this  delta  wing  with  a  sweep  angle 
of  56.5".  Interactions  of  the  two  leading-edge  vortices  are  expected  for  wings  with  a  larger  sweep 
angle  (Lee  &  Ho  1990).  Pitching  and  yawing  torques  can  also  be  obtained  by  using  the  miniature 
actuators  (Ho  et  al  1997). 


Figure  16  Normalized  rolling  moment  as  a  function  of  actuator  location  at  angle  (25")  of 
attack. 

6.2  Thrust  Vectoring 

Deflecting  the  momentum  of  a  jet  to  a  direction  other  than  the  one  normal  to  the  nozzle  plane  is 
useful  in  applications  such  as  fluidic  switching  and  aircraft  maneuvering.  Coe  et  al  (1994)  used  a 
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micromachined  vibrating  diaphragm  driven  by  electrostatic  force  under  a  cavity  to  generated  a 
miniature  jet.  This  jet  is  established  by  asymmetric  entrainment  during  the  push-pull  phase.  In  the 
pushing  phase,  the  fluid  near  the  jet  axis  is  moved  away  from  the  diaphragm.  When  the 
diaphragm  is  pulled  back,  fluid  all  around  the  cavity  flows  toward  the  diaphragm  and  a  net 
concentrated  jet  flow  is  established.  Acousticians  refer  to  this  phenomenon  as  a  quartz  wind 
(Kuttruff  1991),  because  quartz  has  been  used  as  the  diaphragm  material  in  acoustic  transducers. 
When  the  micro  jets  are  placed  at  the  nozzle  of  a  rectangular  jet  with  a  7.62-cm  x  1.27-cm  nozzle, 
the  direction  of  the  large  jet  can  be  changed  according  the  amplitude  of  the  micro  jets  (Fig.  17). 
This  capability  provides  a  simple  thrust-vectoring  technique  (Smith  &  Glezer  1997),  which  is 
obviously  much  more  advantageous  than  the  present  method  of  using  paddles  to  deflect  the  jet. 


Figure  17  Thrust  vectoring  by  a  micro  jet  (Smith  &  Glezer  1997). 


6.3  Control  of  Turbulent  Skin  Friction 

The  discovery  of  organized  structures  (Brown  &  Roshko  1974)  has  significantly  facilitated 
turbulence  control  of  free  shear  flows  (Ho  &  Huang  1982,  Oster  &  Wygnanski  1982).  However, 
attempts  to  control  wall  flows  have  been  much  less  successful.  The  dynamics  of  the  large-scale 
structures  in  free  shear  flows  are  relatively  well  understood  (Ho  &  Huerre  1984,  Huerre  & 
Monkewitz  1990)  through  the  instability  process.  The  underlying  mechanism  by  which  the 
organized  structures  are  produced  and  sustained  in  wall  flows  are  less  well  known  (Robinson 
1991).  The  following  are  problems  for  experimental  studies: 

1 .  The  transverse  length  scale  of  the  streaks  is  on  the  order  of  100s  of  microns. 

2.  The  lifetime  of  the  streaks  is  about  one  ms  or  less. 

3 .  The  streaks  spread  over  the  entire  surface  area  covered  by  the  turbulent  boundary  layer. 
The  small  size  is  a  direct  cause  of  great  difficulties,  because  traditional  transducers  are  larger  than 
the  structures.  These  relatively  large  transducers  do  not  have  enough  spatial  resolution  to  measure 
the  stmcture’s  properties  and  to  explore  the  structure’s  response  to  perturbations.  We  cannot 
devise  control  schemes  analogous  to  research  we  have  conducted  in  the  past  on  free  shear  layers. 
The  micromachined  transducers  match  the  size  of  the  streaks,  and  integrated  logic  circuitry 
enables  real-time  decisions.  A  micro  system  provides  an  opportunity  to  directly  manipulate 
individual  wall  structures.  This  is  the  reason  that  wall-bounded  tuibulent-flow  control  is  chosen 
as  a  testbed  for  developing  a  micro  system  that  comprises  sensors,  lo^c  circuitry,  and  actuators. 
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6.3.1  DISTRIBUTED  MEASUREMENT_A  surface  shear-stress  imaging  chip  that  comprises 
85  micro  sensors  was  developed  to  measure  the  instantaneous  turbulent  shear  stress  in  a 
distributed  manner.  The  size  of  the  sensors  and  the  spacing  between  them  are  chosen  to  match  the 
length  scale  of  near- wall  stream-wise  streaks  in  the  tested  number  range.  Figure  18  shows  the 
instantaneous  surface  shear-stress  contours  measured  by  the  sensor  array.  Only  25  sensors  in  a 
linear  span-wise  array  are  used.  The  vertical  direction  is  a  pseudo-stream-wise  plot  generated  by 
multiplying  time  by  the  convection  speed.  The  capability  of  measuring  the  surface  shear  stress  is 
not  only  essential  for  the  interactive  control  of  the  high  speed  streaks  but  also  provides  interesting 
statistical  properties  of  the  streaks  (Motoaki  et  al  1997). 


Figure  18  Instantaneous  surface  shear  stress  measured  by  an  imaging  chip  (Ho  et  al  1997). 

6.3.2  INTERACTION  BETWEEN  THE  ACTUATOR  AND  WALL  STRUCTURES 
Being  able  to  perform  distributed  sensing  is  a  major  step  forward.  Understanding  how  the 
actuator  interacts  with  the  near  wall  structure  is  an  even  more  important  part  of  the  control 
process.  Actuators  are  wall  mounted  devices  that  can  produce  sur&ce  deformation  and/or  flow 
disturbances.  The  flow  altered  by  a  rising  obstacle  was  studied  by  direct  numerical  simulation 
(Carlson  &  Lumley  1996).  Vortical  structures  are  produced  by  the  unsteady  motion  of  the 
actuator  and  are  deformed  while  the  actuator  moves  through  the  boundary-layer  velocity  shear. 
These  actuator-produced  perturbations  are  erqrected  to  alter  the  near  wall  structure  in  such  a  way 
that  a  skin  fiiction  reduction  is  achieved.  Obviously  realizing  this  concept  depends  on  firiding 
proper  actuation  commands,  which  in  turn  requires  an  in-depth  understanding  of  the  non-linear 
interaction  between  the  artificial  disturbance  and  the  high  speed  streaks.  However,  almost  all  of 
the  current  experiments  investigate  the  effect  of  an  actuator  on  stationary  vortices  embedded  in  a 
laminar  flow  (Jacobson  &  Reynolds  1995,  Ho  &  Tai  1996),  because  studies  of  the  effect  on  the 
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streaks  in  turbulent  flow  require  a  complete  micro  system  (see  Section  6.3.3),  which  is  still  under 
development. 

At  the  entrance  region  of  a  2-D  channel  flow,  the  boundary  layer  is  laminar.  A  1.3-mm-thick 
bump  was  placed  normal  to  the  flow  and  terminates  half  way  in  the  span- wise  direction  (Ho  &  Tai 
1996).  At  the  terminating  position  of  the  bump,  a  stationary  longitudinal-vortex  pair  is  produced. 
A  micromachined  actuator  (Fig.  16),  which  is  a  silicon  nitride  flap  driven  by  an  electromagnetic 
field,  is  placed  downstream  of  the  vortex  generator.  During  the  upward  moving  phase,  the 
ensemble-averaged  vertical  velocity  field  (Fig.  19)  shows  the  transport  of  high  velocity  fluid  away 
fi'om  the  surface.  It  is  interesting  to  note  that  the  downward  motion  of  the  actuator  does  not 
produce  a  synunetric  cancellation  effect.  Because  of  the  presence  of  the  wall,  the  low  speed  fluid 
under  the  flap  is  pushed  upward  during  the  downward  motion  of  the  flap  so  that  a  net  reduction  in 
the  skin  drag  is  achieved  through  a  complete  cycle  of  the  flap  motion.  The  surface  shear  stress 
was  integrated  along  the  span-wise  direction  and  over  the  whole  cycle  of  the  motion  of  the 
actuator.  The  integrated  shear  stress  normalized  to  the  dynamic  head  is  defined  as  the  skin  drag 
coefficient  (Cdv).  If  a  proper  driving  signal  is  applied  to  the  actuator,  the  skin  drag  coefficient  of 
the  vortex  actuator  interaction  (Cd)  is  lower  than  the  drag  coefficient  of  the  stationary  vortex 
alone.  The  net  reduction  of  the  surface  shear  stress  is  the  difference  between  these  two 
coefficients.  The  net  reductions  are  plotted  for  various  actuator  fi-equencies  (co)  and  maximum  tip 
heights  (d)  in  figure  20..  The  product  of  co  and  d  is  the  maximum  normal  velocity  of  the  actuator. 
For  constant  tod,  the  net  reductions  in  the  skin  friction  drag  are  the  same  for  different 
combinations  of  (o  and  d.  This  result  indicates  that  the  amount  of  shear  stress  reduction  is  directly 
related  to  the  ability  to  transport  high-speed  fluid.  In  other  words,  we  need  to  seek  actuators  that 
can  efficiently  pump  high-speed  fluid  away  from  the  wall  with  low  power  expenditure  and  low 
form  drag. 


216”  -  ^  288”  -  ^ 

Fig.  19  Vertical  velocity  contours  of  an  flap  actuator  interacting  with  a  longitudinal  vortex  pair. 
The  phase  angle:  0®  and  360"  -  flap  on  the  surface,  180"  -  flap  at  its  up  most  location 
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Figure  20  Surface  shear-stress  reduction  as  a  function  of  actuator  frequency,  o,  and  maximum 
actuator  tip  height,  d.  (•  -  cod  =  80  mm/s  and  ■  -  od  =  100  mm/s,  (Ho  et  al  1997)) 

In  a  water  chaimel,  the  interaction  between  a  piezo-electrically-driven  actuator  with  a  vortex  pair 
developed  from  a  cylinder  was  studied  (Jacobson  &  Reynolds  1995).  Surface-mounted  hot-film 
sensors  were  used  to  measure  the  surface  shear  stress,  which  serves  as  an  input  for  feedback 
control.  The  optimum  amplitude  of  the  actuator  obtained  from  the  feedback  control  can  delay  the 
laminar  turbulent  transition  to  about  40  displacement  thicknesses  from  the  natural  transition 
region  (Fig.21). 


Figure  21  Transition  delay,  uVuo,  by  actuation,  ,  where  6*  is  the  displacement  thickness 
(Jacobson  and  Reynolds  1995). 
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6.3.3  A  MICRO  SYSTEM  FOR  SURFACE  SHEAR-STRESS  REDUCTION  The  lifetime 
of  the  high  speed  streaks  is  short,  and  a  large  number  of  the  streaks  need  to  be  controlled  at  the 
same  time.  If  all  of  the  sensor  outputs  were  to  be  sent  to  a  central  computer  and  the  control 
command  were  to  be  sent  from  the  computer  to  each  actuator,  a  very  high-bandwidth  signal  path 
and  large  number  of  leads  would  be  required.  Implementing  local  and  simple  signal  processing  is 
obviously  a  necessity  for  alleviating  these  problems.  Neural  networks  are  a  viable  approach 
(Jacobson  &  Reynolds  1995,  Lee  et  al  1997).  A  neural-network-based  circuit  was  developed  to 
determine  the  time  and  the  spatial  extent  of  the  high  speed  streaks  passing  the  shear-stress-sensor 
array  (Gupta  et  al  1996).  The  output  of  the  circuit  was  used  for  actuation.  The  task  of  integrating 
a  micro  system  containing  micro  sensors,  microelectronics,  and  micro  actuators  on  a  chip  is 
possible,  yet  it  is  not  trivial.  The  micro  transducers,  which  integrate  the  circuit,  need  to  be 
fabricated  at  a  much  higher  temperature  than  that  of  the  circuits,  which  means  that  the  diSusion  of 
the  doping  material  in  the  circuits  may  affect  their  performance.  The  process  flow  needs  to  be 
carefully  plarmed  to  eliminate  cross-contamination  possibilities.  A  micro  system  for  shear  stress 
reduction  was  fabricated  on  a  1-cm  x  1-cm  die;  18  micro  shear-stress  sensors,  3  micro  flap 
actuators,  as  well  as  circuits  for  logic  sensor  drivers  and  actuator  drivers  are  monolithically 
integrated  (Fig.  22).  This  is  a  prototype  design,  and  various  tests  of  the  functionalities  are  still 
underway. 


Actuator  Driver 

Neural  Networik  Based 
Signal  Processing  Unit 


Actuator 

Shear  Stress  Sensor 


Figure  22  A  micro  system  for  surface  shear-stress  reduction  (Ho  et  al  1997). 

7.  CONCLUDING  REMARKS 

The  capability  of  applying  a  large  number  of  transducers  with  on-board  signal  processing  provides 
a  paradigm  shift  for  flow  control.  This  type  of  micro  system  enables  us  to  perform  real-time 
distributed  control.  Hence,  the  ability  to  interactively  manipulate  individual  high-speed  streaks  in 
the  turbulent  boundary  layer  is  made  possible  and  could  be  effective  in  decreasing  the  surface 
friction  force.  A  near-term  industrial  application  for  the  skin-fiiction-drag  experiment  is  not 
obvious.  This  approach,  however,  will  definitely  facilitate  our  exploration  of  the  near  wall 
structure  under  artificial  perturbations,  and  the  findings  will  lead  us  toward  a  fundamental 
understanding  of  the  grand  challenge  in  turbulence  research.  Furthermore,  other  types  of  flow 
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control  may  also  be  inspired  by  the  real-time,  distributed  control  ability  of  MEMS-based 
transducers.  The  large  torque  on  a  delta  wing  and  the  thrust  vectoring  achieved  by  micro 
actuation  are  examples.  We  expect  that  a  stream  of  new  flow  phenomena  will  be  introduced  by 
MEMS-based  sensing  and  control  schemes. 

In  the  field  of  biomedical  analysis,  MEMS  technology  has  already  sparked  a  major  transformation. 
Flows  through  micro  channels  or  nozzles  are  the  most  common  configurations  in  all  of  the 
biomedical  applications.  Many  unexpected  phenomena  have  been  observed  in  Newtonian  fluid 
flows  through  these  simple  configurations.  These  types  of  flows  are  extremely  intriguing,  because 
they  challenge  the  fundamentals  of  fluid  dynamics;  the  non-slip  boundary  condition  clearly  should 
be  relaxed  for  gas  flows.  In  liquid  flows,  extra  surface-force  terms  may  need  to  be  added  to  the 
momentum  balance  equation.  Whether  any  modification  needs  to  be  made  to  the  constitutive 
equation  for  a  simple  fluid  remains  to  be  investigated.  It  is  evident  that  micro  flow  has  opened  a 
new  domain  for  basic  fluid-mechanics  research. 
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Abstract 

In  this  paper,  we  consider  linear  stabilization  of  plane,  Poiseuille  flow  using  linear 
quadratic  Gaussian  (LQG)  optimal  control  theory.  In  prior  investigations,  it  has  been 
shown  that  plane,  PoiseuiUe  flow  may  be  stabilized  using  extremely  simple,  integral 
compensator  controllers,  given  that  sensor  locations  produce  minimum  phase  zeros. 
However,  the  controller  is  marked  by  slow  dissipation  of  perturbation  energy.  In  this 
investigation,  we  exploit  LQG  methods  to  give  faster  perturbation  energy  dissipation. 

We  show  that  we  may  achieve  approximately  double  the  dissipation  rate  than  that 
reported  in  integral  compensator  control  schemes.  This  increase  in  performance  comes 
with  a  cost.  Controllers  built  using  LQG  methods  for  infinite-dimensional  systems 
must,  in  theory,  be  infinite-dimensional.  Since  it  is  impossible  to  implement  infinite¬ 
dimensional  controllers,  we  implement  reduced-order  controllers  on  the  full-order  plant. 

’Part  of  this  material  was  first  presented  at  the  Conference  on  Decision  and  Control,  New  Orleans,  LA, 
December,  1995  [1]. 

^Current  Address:  Guidance  and  Control  Analysis  Group,  Jet  Propulsion  Laboratory,  California  Institute 
of  Technology,  Pasadena,  CA  91109. 
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We  show  that  this  procedure  can  in  theory  lead  to  de-stabilization  of  unmodelled  dy¬ 
namics.  We  then  show  that  this  may  be  avoided  using  distributed  control  or,  dually, 
distributed  sensing.  An  additional  problem  in  high-plant-order  LQG  controller  de¬ 
sign  is  numerical  instability  in  the  synthesis  equations.  We  show  a  sub-optimal  LQG 
synthesis  method  that  uses  an  extremely,  low-order  plant  model  for  synthesis  of  a  high- 
order  controller.  This  sub-optimal  controller  produces  results  essentially  equivalent  to 
the  optimal  controller.  Finally,  we  apply  our  LQG  controller  to  a  plant  of  differing 
Reynolds  number  from  that  used  in  the  LQG  design.  Using  this  new  plant,  we  show 
that  the  system  is  still  stabilized.  However,  the  goal  of  double  dissipation  rate  is  not 
achieved,  showing  the  inherent  non-robustness  (in  terms  of  degree  of  stabilization)  of 
this  controller. 


1  Introduction 

The  basic  concepts  of  feedback  control  of  plane,  Poiseuille  flow  were  laid  out  in  [2].  In  [2],  it 
was  shown  that  the  governing  Navier-Stokes  equations  can  be  converted  to  control- theoretic 
transfer  function  and  state  space  models  using  a  numerical  discretization  method.  Using 
the  transfer  function  models,  it  was  shown  that  plane  Poiseuille  flow  (channel  flow)  can  be 
stabilized  using  a  simple,  constant  gain  feedback,  integral  compensator  controller.  The  goal 
of  the  control  design  in  [2]  was  stability.  By  choosing  proper  sensor  locations,  we  were  able 
to  achieve  a  stable,  closed  loop  system  that  was  extremely  robust  to  changing  Reynolds 
numbers.  The  goal  of  the  present  paper  is  to  design  an  optimal  controller  by  moving  away 
from  classical  transfer  function  control  design  to  modern,  state  space  methods.  State  space 
control  methods  have  several  advantages.  State  space  methods  allow  straightforward  design 
of  controllers  for  multiple-input/multiple-output  systems.  Furthermore,  state  space  and 
optimal  control  methods  allow  far  greater  control  of  the  position  of  closed  loop  eigenvalues. 
In  the  transfer  function  design  used  thus  far,  the  system  was  stabilized,  but  the  stable  system 
still  had  closed  loop  eigenvalues  very  near  the  imaginary  s  axis.  This  translated  to  very  slow 
dissipation  of  perturbation  energy.  The  goal  of  the  present  paper  is  to  design  an  optimal, 
stabilizing  controller  that  constrains  the  controllable  and  observable  closed  loop  poles  to  be 
farther  to  the  left  of  the  imaginary  s  axis,  thus  leading  to  faster  dissipation  of  perturbation 


3 


energy. 

The  increased  flexibility  of  state  space  methods  does  not  come  without  cost.  Unlike  the 
simple  integral  feedback  control  of  [2],  optimal  controllers  are  complicated  systems  in  them¬ 
selves.  This  adds  considerably  to  the  complexity  of  the  overall  closed  loop  system.  In  fact, 
many  beneficial  qualities  have  been  proven  assuming  the  dimension  of  the  controller  is  the 
same  as  that  of  the  plant.  In  the  flow  case,  this  brings  a  special  problem  since  the  plant  is 
of  infinite  dimension.  Theoretically,  the  controller  must  also  be  of  infinite  dimension.  This 
is  impractical  for  many  reasons.  First,  it  is  impossible  to  physically  implement  an  infinite¬ 
dimensional  controller.  Secondly,  the  use  of  even  very  high-order  finite-dimensional  plants 
leads  to  numerical  problems  in  the  optimal  control  synthesis  equations.  As  a  result,  we  will 
design  an  optimal  controller  using  a  reduced-order  model  of  the  infinite-dimensional  plant 
and  then  apply  the  resulting  controller  to  a  full-order  plant.  However,  applying  reduced 
order  controllers  to  full  order  plants  has  the  risk  of  making  unmodelled,  stable  parts  of  the 
plant  imstable.  Therefore,  controllers  must  be  designed  to  ensure  this  does  not  happen.  Fur¬ 
thermore,  LQG  controllers  have  been  shown  to  be  very  non-robust  [3]  in  many  applications. 
We  will  provide  one  example  investigating  robustness  in  the  channel  flow  case. 

This  paper  is  organized  as  follows.  In  section  2,  the  linear  channel  flow  problem,  state- 
vaxiable  control  models,  and  the  single-wavenumber  flow  model  are  reviewed.  This  section 
is  essentially  a  review  of  those  contained  in  [2].  In  section  3,  linear  quadratic  Gaussian 
(LQG)  controller  design  is  introduced  and  ways  in  which  closed  loop  eigenvalues  can  be  made 
stable  to  a  prescribed  degree  is  shown.  Section  4  explains  how  the  unavoidable  unmodelled 
dynamics  of  any  reduced  order  model  of  an  infinite-dimensional  plant  can  lead  to  closed 
loop  instability  in  LQG  design.  Distributed  actuation  and  distributed  sensing  are  shown 
to  be  dual  solutions  to  the  stability  problem.  Section  5  demonstrates  the  performance  of 
high-order  optimal  controllers.  Section  6  presents  the  concept  of  a  sub-optimal,  extremely 
low-order  controller  design  that  achieves  comparable  performance  to  the  high-order  optimal 
controller  design.  Finally,  the  synthesized  controller  is  used  on  a  plant  of  different  Reynolds 
numbers  to  investigate  robustness. 


2  The  Linear  Channel  Flow  Control  Problem 

2.1  Dynamic  Equations 


We  consider  the  same  plant  as  in  [2],  i.e.  two-dimensional,  plane,  Poiseuille  flow  between 
two  parallel,  stationary  plates.  Let  the  channel  be  of  flnite  length  and  finite  height,  with  the 
centerline  at  zero.  The  flow  in  the  channel  is  described  by  the  unsteady,  non-linear,  incom¬ 
pressible  Navier- Stokes  equations.  In  order  to  study  the  linear  stability  of  the  system,  we 
follow  the  standard  procedure.  Consider  small  perturbations  in  the  velocities  of  u(x,  y,  t)  in 
the  horizontal  direction,  u(x,  y,  t)  in  the  vertical  direction,  and  p(x,  y,  t)  in  the  pressure  field. 
Let  the  primary  flow  be  represented  by  U{y)  with  Uc  being  the  centerline  velocity.  The  lin¬ 
earized,  incompressible  Navier-Stokes  equations  may  be  formed  by  substituting  the  primary 
flow  and  small  perturbations  into  the  non-linear,  incompressible  Navier-Stokes  equations  and 
disregarding  the  second-order  terms  involving  the  perturbations. 


du{x,y,t)  .du{x,y,t)  dU{y)^,  ,  dp(x,v,t)  1 

„,<)=- y,t)  (1) 


dv{x,y,t) 

dt 


dp{x,y,t)  ,  1 


(2) 


du{x,y,t)  dv{x,y,t) 
dx  dy  ~ 


(3) 


where  the  flow  variables  are  non-dimensionaJized  by  the  channel  half-height,if,  and  centerline 
velocity,  C/g.  Re  is  the  Reynolds  number  defined  as  {UcH/v')  where  v  is  the  kinematic 
viscosity.  By  introducing  a  “stream  function”,  tj){x,y,t)  , 

A  dxl}{x,y,t) 


u 


dy 


and 


(4) 


(5) 


(1-  3)  may  be  combined  into  a  single  equation. 


5  5V  5  5V  rrf  rr/  x  ^  ^V  .  (PU{y)dip  1  9,  ,,, 

+  =  -U(y)-^  -  U(y)^g^  +  ^  (VV)  (6) 


dt  dx^  dt  dy'^ 


dx^ 
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Assume  periodic  boundary  conditions  in  the  streamwise  (x)  direction.  For  channel  flow,  with 
rigid  plates  at  y  =  —  1  and  y  =  1  the  no-slip  boundary  conditions  become, 


i>{x,y  = 

-1,0  =  0 

(7) 

d^. 

=  -1,0  =  0 

(8) 

V>(x,y  = 

=  1,0  =  0 

(9) 

d^ 

=  1,0  =  0 

(10) 

II 

0)  =  ^(a;,y) 

(11) 

With  an  initial  condition, 

the  boundary  value  problem  is  completely  formed.  Equations  (6-11)  represent  the  starting 
point  for  construction  of  a  feedback  control  system.  These  equations  neither  include  any 
control  terms  nor  do  they  describe  any  sensing  of  flow  field  variables. 


2.2  Boundary  Input 

We  consider  the  case  of  blowing/suction  at  the  lower  wall  of  the  channel.  The  boundary 
conditions  are  now  modified  from  before  to  include  boundary  input,  represented  as  the  known 
separable  function  q{t)l{x)f{y), 


i’ix,y=  -1,<)  =  q{t)l{x)f{y  =  -1) 


dtp 

dy 


{x,y  =  -l,t)  =  q{t)l{x) 


9f{y  =  -1) 


dy 


=  0 


V’(x,y  =  l,f)  =  0 


^(x,y  =  l,t)  =  =  0 

Note  that  these  conditions  constrain  the  function  /(y)  such  that 


/(y  =  -l)^0 
dfjy  =  -1) 


dy 

f{y  =  1)  =  0 


=  0 


(12) 

(13) 

(14) 

(15) 

(16) 

(17) 

(18) 
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dfjy  =  1) 

dy 


=  0 


Many  functions  may  be  appropriate.  One  such  function  is 


(19) 


f(.v)  =  +  j!/^  -  -  jK  + 1  (20) 

In  order  to  relate  boundary  conditions  on  ^  to  blowing/suction  in  the  wall-normal  direction, 
we  use  (5)  to  relate  v{x,y,t)  and  't/}{x,y,t).  Then  (12)  becomes 

v(x,  y  =  -l,t)  =  -9{t)^^fiy  =  -1)  (21) 


Note  that  v{x^y,t)  is  related  to  the  derivative  of  l{x). 

The  homogeneous  equation  (6)  and  the  inhomogeneous  boundary  condition  (12)  caa 
be  converted  into  an  inhomogeneous  equation  with  homogeneous  boundary  conditions  by 
introducing 

(/>{x,  y,  t)  =  V’(x,  y,  t)  -  q{t)f{y)l{x)  (22) 


Then  by  substituting  into  (6),  we  obtain 

•  '  d  <Pu(y)  1  1 . 1  ay  I  i 

dt  dx"^  dt  dy^  dx^  dx  dy^  dy^  dx  Re  dx^  Re  dx"^  dy"^  Re  dy^ 


dqit)d%x) 
dt  dx"^ 


f{y) 


dt 


’■l{x) 


dy^ 


The  boundary  conditions  in  terms  of  <f>  are  now 


(f>{y  =  -1)  =  0 


(24) 


d(l>{y  =  -1)  ^ 
dy 

(f>{y  =  1)  =  0 

d<f>{y  =  1)  ^  . 
dy 


(25) 

(26) 
(27) 


The  first  line  of  (23)  is  the  original  dynamical  equation,  (6),  and  the  next  two  lines  are  all 


known  input  terms. 


2.3  Boundary  Output 


We  use  the  streamwise  component  of  shear  at  a  single  boundary  point,  z{xi,y  =  as 

our  boundary  output,  which  is  given  by 


,  .  du{xi,y  - -l,t) 

z{xi,y  =  -l,t)  =  ^ 

oy 

By  expressing  u{xi,y  =  — 1,<)  in  terms  of  the  stream  function  (4), 

and  by  observing  (22) 


(28) 


(29) 


,,  -  _1  rt  _  9V(j^i.?  =  -l,i)  _  9V(:c..!<  =  -l.i)  -,,aV(!(  =  -1),,  .  , 

.(x„y-  l,t)-  - —  +,(() - - - ((j.,)  (30) 


2.4  State  Variable  Control-Theoretic  Models 

Much  of  modern  control  theory  is  based  on  the  state  variable  representation  of  a  dynamic 
system.  This  representation  relies  on  the  basic  fact  that  the  motion  of  any  finite-dimensional, 
dynamic  system  may  be  expressed  as  a  set  of  first-order,  ordinary-differential  equations  in 
matrix  form, 

dx 

-^  =  Ax +  Bq  (31) 

z  =  Cx  (32) 

The  vector  x  is  called  the  state- vector  of  the  system,  q  is  the  input  variable,  and  z  is  the 
output  variable.  The  matrix  A,  and  the  vectors  B  and  C  are  called  the  state  space  matrices 
of  the  single-input,  single-output  (SISO)  system.  More  specifically,  the  A  matrix  is  referred 
to  as  the  dynamic  matrix  of  the  system.  It  can  be  shown  that  the  poles  of  the  system  are 
simply  the  eigenvalues  of  the  A  matrix.  In  the  more  general  case  of  multiple-input,  multiple- 
output  (MIMO)  systems,  B  and  C  are  matrices.  For  generality,  we  may  add  another  term, 
Dq,  to  the  output  equation  to  account  for  systems  in  which  there  is  direct  feedthrough  from 
the  input  to  the  output.  Then, 
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In  the  case  of  no  direct  feedthrough,  Z)  =  0,  the  state  space  and  transfer  function  models 
are  related  as 

^  “  nL.'l  -B)  “ 

where  pi...pi  are  the  poles  of  the  system  and  Ci  •  •  •  0  are  the  zeros  of  the  system. 

It  is  important  to  note  that  the  channel  flow  problem  is  defined  by  partial  differential 
equations  and  therefore,  no  finite-dimensional  model  will  capture  all  its  dynamics.  Therefore, 
any  finite- dimensional  controller  synthesized  from  any  finite- dimensional  model  may  cause 
unmodelled  dynamics  to  become  unstable.  This  issue  will  be  taken  up  in  section  4. 


2.5  Single  Wavenumber  Model 

We  consider  the  periodic  channel  model  shown  in  figure  1  with  boundary  blowing/suction 
and  boundary  shear  measurement.  The  Reynolds  number  considered  is  Re  =  10,000.  The 


Re=1 0,000 
a=1.00 


Figure  1:  System  model  for  Poiseuille  channel  flow.  The  flow  is  periodic  with  the  period 
of  471.  Input  is  applied  along  bottom  plate  as  w{x)  =  sin{x)  and  shear  is  measured  at  tt. 
Re  =  10,000. 


total  length  of  the  channel  is  T  =  47r  leading  to  fundamental  wavenumber,  do  ==  0.5,  see  [4]. 
Recall,  only  integral  multiples  of  this  fundamental  wavenumber  may  exist  in  the  periodic 
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channel.  For  the  single  wavenumber  model,  only  one  wavenumber  is  included  in  the  model, 
corresponding  to  a  =  /ao  =  1.00.  This  wavenumber  is  selected  since  it  is  the  only  wavenum¬ 
ber  that  leads  to  unstable  modes  for  this  channel  geometry  as  can  be  seen  from  the  linear 
stability  curve  of  the  Orr-Sommerfeld  equation. 

Input  is  distributed  along  the  entire  bottom  plate  with  a  sinusoidal  weighting  function. 
In  terms  of  the  input  function,  ty(a;),  (12) 

u;(x)  =  sm(a;)  (35) 

This  type  of  distributed  input  has  very  favorable  properties.  It  will  be  shown  in  section  4  that 
a  distributed  input  of  this  type  leads  to  a  system  in  which  all  modes  are  uncontrollable  except 
those  associated  with  the  wavenumber  of  to(x).  In  this  case,  the  wavenumber  of,  w{x^  = 
5m(1.0x),  is  a  =  1.0.  Therefore,  all  modes  resulting  from  all  wavenumbers  other  than  1.0 
are  uncontrollable.  This  will  allow  us  to  consider  only  those  poles  and  zeros  associated  with 
a  =  1.0  since  the  control  will  affect  these  modes  only.  Note  that  the  physical  blowing/suction, 
u(x,y,<)  =  -q{t)^f{y  =  -1)  =  -q{t)cos{x)f{y  =  -1),  (21).  The  f{y)  function  in  the 
input  is  chosen  as  in  (20).  Exactly  as  in  [2],  a  state  variable  model  is  created  of  this  system. 
In  order  to  visualize  the  control  theoretic  model,  the  A,  B,  and  C  matrices  are  transformed 
to  transfer  function  form,  H{$)]  see  (34).  In  [2]  it  was  shown  that  the  A  matrix  can  be  block 
decomposed  by  wavenumbers  included  and  not  included  in  the  model.  Figure  2  shows  the 
locations  of  the  poles  and  zeros  in  the  s  plane  for  the  channel  flow  system  of  figure  1.  The 


TT,  w[x)  —  Sin(x),L— 4x,  cx  —  1.0. Only  the  top  half  of  the  s  plane  is  shown, 
numerical  verification  of  these  poles  and  zeros  was  described  in  [2]. 
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3  Linear  Quadratic  Optimal  Control  Design 

In  order  to  design  an  optimal  controller,  the  first  step  is  to  define  in  what  sense  the  controller 
is  optimal.  Consider  the  standard  finite-dimensional  state  space  formulation,  derived  for  the 
channel  flow  problem  in  [2]  and  shown  in  transfer  function  form  in  Figure  1, 

=  Ax{t)  -I-  Bu{t)  (36) 

z{t)  =  Cx{t)  (37) 

a;o  =  a:(t  =  0)  (38) 

where  the  state  vector  x{t)  is  (n  x  1),  the  input  vector  u(i)  is  (1  x  1),  the  output  vector,  z{t), 
is  (1  X  1),  the  dynamic  matrix,  A,  is  (n  x  n),  the  input  influence  matrix,  5,  is  (n  x  1),  and 
the  output  influence  matrix,  C,  is  (1  x  n).  In  optimal  control  design,  an  optimal  controller  is 
defined  as  a  controller  which  produces  an  input  signal,u(t),  that  minimizes  a  cost  functional 
J(u)  such  that 

J{uopt)  <  JW)  Vu(.)  G  W  =  {«(.)  :  u{.)piecewise  —  continuous,  f  ||  u{t)  |p  dt  <  oo} 

V  0 

(39) 

In  the  following,  we  will  assume  that  the  model  of  (  36)-  (38)  is  reduced  by  removing  all 
uncontrollable  and  unobservable  modes.  Therefore,  {A,B)  is  controllable  and  {A,C)  is 
observable  in  the  reduced  model. 


3.1  Infinite  Optimal  Control  Time  Problem 


Consider  a  cost  functional 


•^(w)  =  [i2a:(t),x(t)]  -I-  [A«(t),«(<)]dt 


where  R  is  o,  semi-positive-definite  matrix  and  A  is  a  positive-definite  matrix  which  are 
defined  a-priori.  The  R  matrix  defines  the  penalty  on  the  state  and  the  A  matrix  defines 
penalty  on  the  control.  We  may  show  that  the  optimal  control  is  given  by^ 

'^^opt(0  ^  ^  ^  Px(t)  (41) 

^In  the  following,  ♦  will  denote  the  complex  conjugate  transpose  operation. 
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where  the  matrix,  P,  is  given  as  the  positive-definite  solution  of  the  algebraic  Riccati  equation 

(A.R.E.), 

PA  -h  A*P  -  PBA-^B*P  -I-  P  =  0  (42) 

and  B  is  the  input  influence  matrix  in  the  state  space  system  (34).  Consider  the  plant 
equation  (36)  with  the  optimal  control  (41)  applied, 

dx 

—  =  (A  -  BA-^B*P)x  (43) 

The  closed  loop  system  is  stable  iff  the  eigenvalues  of  the  matrix,(A-PA“^P*P),  are  stable. 
It  can  be  shown  [5]  if  the  pair  (A,B)  is  controllable  ^  and  (A,  R)  is  observable^  ,  then  there 
exists  P  >  0  such  that 

PA  +  A*P  -  PBh-^B*P  +  R  =  0 


for  P  >  0. 


3.2  Prescribed  Degree  of  Controller  Stability 

In  order  to  achieve  a  prescribed  degree  of  stability  [5]  with  an  optimal  controller,  consider 
the  exponential  cost  functional 

J(«)=^lm^^  e2^‘([Pa;(i),x(t)]-|-[Au(t),«(t)])dt  (44) 

where  the  matrix  R  is  again  semi-positive-definite,  A  is  again  positive-definite  and  defined 
a-priori  and  7  is  a  positive  scalar.  We  may  show  [5]  that  we  may  convert  this  problem  into 
a  quadratic  form  and  solve  for  a  controller  that  prescribes  all  eigenvalues  of  the  closed  loop 
system  to  be  to  the  left  of  s  =  —7.  Rewrite  the  cost  functional  as 

J(u)  =  ^lim  i £[Re-'<x(t), e^x(t)\  +  (Ae^'uCi), e-<'u{t)]dt  (45) 

system  is  said  to  be  controllable  if  and  only  if  it  is  possible,  by  means  of  the  input,  to  transfer  the 
system  from  any  initial  state  x(t)  =  Xt  to  any  other  state  xt  =  x(T)  in  a  finite  time  T  —  t>0.  For  a  linear, 
time  invariant  system  described  in  state  space,  the  system  in  controllable  if  its  controllability  grammian, 
given  by  P{T)  =  is  invertable.  [6] 

^An  unforced  system  is  said  to  be  observable  if  and  only  if  it  is  possible  to  determine  any  (arbitrary 
initial)  state  x(<)  =  xt  by  using  only  a  finite  record,  z(r)  for  t  <  r  <  T,  of  the  output. For  a  linear,  time 
invariant  system  described  in  state  space,  the  system  in  observable  if  its  observability  grammian,  given  by 
M(T)  =  e^'*C*Ce^*dt  is  invertable.  [6] 


Define 
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x{t)  =  e^*x{t) 


and 


Then 


Note 


u{t)  =  e**u(t) 


.  1  fT  _ 

J{u)  =  lim  —  /  [i?x,  x]  +  [Aw,  tildf 

T— ►oo  i  Jq 


dt  dt 


dt 


By  substituting  for 


dx 

■^  =  {A  +  /7)x  +  Bu 

Let  A  =  A  +  I'y.  Then  the  optimal  control  is  given  by 


and 

=  -A-^B*Px{t) 

where  P  is  obtained  by  solving  the  Riccati  equation 

PA  +  TP  -  PBA-^B*P  +  R  =  0 


The  new  closed  loop  system  dynamics  matrix  becomes 


(46) 

(47) 

(48) 

(49) 

(50) 

(51) 

(52) 

(53) 


A  -  BA-^B*P  (54) 

whose  eigenvalues  are  all  to  the  left  of  s  =  -7.  It  can  be  shown  [4]  that  if  {A,B)  is 
controllable  and  (/I,  R)  is  observable,  then  (A  +  7/,  B)  is  controllable  and  there  exists  a 
positive  definite  solution,  P,  of  the  matrix  Riccati  equation  (53)  for  i2  >  0  . 

In  theory,  for  full  order  plant  models,  there  is  no  restriction  on  the  value  of  7.  In  practice, 
however,  for  controllers  built  using  reduced  order  plant  models,  7  is  limited  by  robustness 
of  the  controller  when  applied  to  the  full  order  system. 


3.3  Optimal  Estimator  Design 
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In  the  previous  subsections,  we  derived  an  optimal  controller.  As  can  be  seen  from  equation 
(41),  the  control  is  always  represented  in  terms  of  the  current  state,  x{t).  These  states, 
however,  are  not  available  to  us  in  the  channel  problem.  Instead,  we  have  access  to  shear 
measurements  at  only  one  or  several  locations  along  the  boundary.  Therefore,  we  must 
construct  an  observer  to  estimate  the  state,  x{t),  from  the  measured  shear  outputs,  z{t).  In 
order  to  see  how  this  is  done,  consider  a  slightly  more  complicated  version  of  the  state  space 
model  shown  earlier 

dx(t)  ,  , 

=  Ax{t)  +  Bu{t)  +  t;(t)  (55) 

z{t)  =  Cx{t)  +  u;(t)  (56) 

xq  =  x{t  -  0)  (57) 

where  (A,  B)  is  again  assumed  controllable  and  {A,  C)  is  again  assumed  observable.  The 
variables  v{t)  and  w(t)  are  Gaussian,  white  process  and  measurement  noises,  respectively. 
They  are  stochastic  variables  at  any  given  time.  As  a  result  of  their  inclusion  in  the  system 
model,  the  state,  a:(t),  and  the  measurement,  z(t),  become  stochastic  processes  that  may  be 
analyzed  only  in  terms  of  probability. 

Assumption  1  The  noise  processes  v{t)  and  w(t)  are  white,  Gaussian,  of  zero  mean,  in¬ 
dependent  of  each  other,  and  have  known  covariances.  The  matrices  Pq,  Q^,  and  We  are 
positive-definite. 

^[u(t)u*(r)]  =  Qe6{t  -  r)  E[t;(t)]  =  0 
£[«;(<)u;*(r)]  =  WeS{t  -  r)  E[w{t)]  =  0 
£^[x(fo)]  =  Tn 

i?([a:(to)  -  m][a;(to)  -  m]*)  =  Po 

where  E{.)  is  an  expectation  operator. 

It  may  be  shown  that  a  steady  state  estimator  may  be  constructed  that  minimizes  the  error 
covariance  between  the  actual  state,  x(t),  and  the  estimated  state,  x(Zt),  where  Zt  =  {z{l)  : 
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— oo  </<<},  i.e.  the  measurement  history. 

min 

i(Z,)  E([x(t)-x(Z,)][x(t)-x(Z,)Y)  (58) 

It  can  be  shown  [7]  that  the  optimal  estimate,  in  the  sense  of  (58),  is  given  by  the  conditional 
expectation 

x{t)  =  x{Zt)  =  E  {x{t)lZt)  (59) 

where  E{.fZt)  is  the  conditional  mean  operator.  In  the  linear  case  with  Gaussian  noises, 
the  structure  of  the  estimator  is 

^  =  Axit)  +  Suit)  +  K,{Cx{t)  -  z(t))  (60) 

where 

K,  =  -P,C'w;'  (61) 

and  Pe  is  calculated  from  a  matrix  Riccati  equation 

PeA*  +  APe  -  PeC*Wr^CPe  +  Qe  =  0  (62) 

Note  that  the  stability  of  (60)  depends  on  the  stability  of  {A—PeC*W~^C).  The  assumptions 
that  We,  Qe>  0  and  {A,  C)  observable  assure  3Pe  >  0  such  that  (62)  is  satisfied. 

3.4  Prescribed  Degree  of  Estimator  Stability 

By  considering  a  slightly  different  estimator  Riccati  equation,  we  may  constrain  the  closed 
loop  estimator  poles  to  be  stable  to  a  prescribed  degree.  It  can  be  shown  [4]  that  if  {A,  C) 
is  observable,  then  {A  +  7/,  C)  is  observable  and  there  exists  a  positive  definite  solution  of 
the  matrix  Riccati  equation 

Te{A  +  7/)*  +  (4  +  -lI)Te  -  TeC*W:^CTe  +  (?«  =  0 

for  Qe  >  0,  where  7  is  a  real,  positive  scalar.  Then  all  eigenvalues  of  (4  —  'PlC*W~^C)  are 
all  to  the  left  of  s  =  —7. 


3.5  Separation  Principle  in  LQG  Control 


The  goal  of  linear  quadratic  Gaussian  (LQG)  design  is  to  combine  the  results  of  deterministic 
linear  quadratic  control  theory  and  stochastic  estimation  theory  to  form  an  overall  control 
system.  As  we  have  seen,  our  system  equations  become  stochastic  with  the  addition  of  noise 
terms.  Therefore,  in  our  controller  design,  we  may  no  longer  minimize  a  deterministic  cost 
functional.  Rather,  we  now  minimize  the  expected  value  of  the  cost  functional, 


=  E  ^^lim  i [Rx{t), a;(t)]  +  [Au{t), (63) 

where  x{t)  is  now  a  stochastic  process.  It  can  be  shown  that  the  optimal  control  is  now 
expressed  in  terms  of  the  estimated  state, 

Uoptit)  =  -A~^B*Px(t)  (64) 

The  complete  LQG  solution  is  then 

dx(t) 

=  Ax{t)  +  Bu{t)  +  v(t)  plant  (65) 

z{t)  =  Cx(t)  +  w(t)  observation  (66) 

xo  =  a:(t  =  0)  initial  condition  (67) 

dx(t') 

=  Ax(t)  +  Bu(t)  +  Ke{Cx{t)  —  z(t))  estimator  (68) 

Uopt{t)  =  —A~^B*Px{t)  feedback  (69) 

where  P  is  the  positive-definite  solution  of  (42)  and  Ke  is  given  in  (61). 


We  may  show  that  the  overall  estimator/controller  system  is  stable  by  stacking  the  state, 
x(<),  and  the  error,  e(t)  =  x  —  x,  into  one  vector  and  studying  the  dynamics  of  the  new 
system. 


dx(i) 

dt 

{A  -  BA-^B*P) 

-BA-^B*P 

x{t) 

+ 

v{t) 

de(t) 

dt 

0 

{A  -  P,C*Wf^C) 

e{t) 

—  KeW{t)  —  v(t) 

The  stability  of  the  system  is  determined  by  the  eigenvalues  of  the  dynamical  matrix.  Clearly, 
the  eigenvalues  are  comprised  of  the  eigenvalues  of  the  closed  loop  controller,  {A—BA~^ B* P) 
and  the  eigenvalues  of  the  closed  loop  estimator,  {A  —  PeC*W~^C).  We  have  already  proven 


that  both  of  these  matrices  are  stable  (under  appropriate  assumptions).  Therefore,  the 
overall  control  system  is  stable  also.  This  is  known  as  the  separation  principle  in  LQG 
control. 


4  Effects  of  Unmodelled  Wavenumber  Dynamics  on 
the  LQG  Problem 


We  have  already  seen  that  the  separation  principle  in  LQG  control  allows  us  to  show  that  if 
the  controller  and  estimator  are  both  stable,  then  the  overall  system  is  stable.  We  will  see 
in  this  section  that  this  principle  breaks  down  in  the  presence  of  unmodelled  dynamics. 

As  described  in  [2] ,  any  finite-dimensional  model  is  a  reduced-order  model  for  the  infinite¬ 
dimensional  channel  flow  problem.  In  terms  of  poles  and  zeros  studied  earlier,  more  poles  and 
zeros  exist  in  the  system  than  are  accounted  for  in  the  model  and  the  subsequent  controller 
design.  It  is  easy  to  imagine  that  an  unmodelled  pole  could  be  drawn  to  the  unstable  s  plane 
by  a  reduced-order  controller.  As  a  result,  even  though  the  designed  controller  may  stabilize 
the  reduced-order  plant,  it  may  not  stabilize  the  actual  infinite-dimensional  plant.  Consider 
the  following  partition  of  the  state  space  model  with  noise  terms  added 


dx 

dt 


A.JTI  0 

0  Au 


+ 


Bm 

Bu 


u{t)  d- 


Vmit) 

Vu{t) 


(71) 


z  = 


Cm  Cu 


X  -f-  w{t) 


(72) 


where  the  subscripts  m  and  u  represent  the  modelled  and  unmodelled  parts  of  the  system. 
The  process  noise,  v  ,  and  the  measurement  noise, u>,  are  assumed  to  be  Gaussian,  indepen¬ 
dent,  zero  mean,  white  noise  processes  such  that 


.E[u(f)u*(T)]  =  Q^6{t-T)  (73) 

£;Ht)u;*(r)]  =  W,S{t-T)  (74) 


The  unmodelled  part  is  meant  to  denote  only  the  dynamics  of  wavenumbers  left  out  of 
the  reduced  order  model.  A„  is  assumed  stable.  Recall  [2]  that  both  Au  and  Am  are  of 
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infinite  dimension;  Au  because  of  the  infinite  number  of  wavenumbers  left  out  of  the  reduced 
order  model  and  Am  because  of  the  infinite  number  of  poles  for  each  of  the  finite  number  of 
modelled  wavenumbers. 

Since  we  only  know  the  modelled  part  of  the  system,  we  design  an  LQG  controller /observer 
based  on  that  part.  Minimize  the  expected  value  of  a  cost  functional,  J, 

(75) 

where  R  is  any  semi-positive  definite  matrix  and  A  is  any  positive  definite  matrix.  The 
optimal  control  is  of  the  form 

=  -A~^B*Pxm{t)  (76) 

where  Bm(t)  is  the  estimate  of  the  modelled  state  and  the  matrix, P  ,  is  calculated  by  solving 
the  algebraic  Riccati  equation 

A*^P  -t-  PAm  -  PBmh-^BlP  -H  R  =  0  (77) 

assuming  the  same  assumptions  as  in  (65)-(69)  for  the  modeled  parts. 

Since  we  cannot  obtain  direct  measurements  of  the  current  state,  Xm{t)i  we  construct  an 
observer  as  described  earlier 

dx  ft) 

=  AmXm(t)  +  BmU(t)  -f-  Ks(CmXm(t)  -  z(t))  (78) 

where  the  estimator  gain 

ATe  =  (79) 

requires  the  solution  of  another  matrix  Riccati  equation 

PeA-^  +  AmPe  -  PeQ W-'C^Pe  +  Q,  =  0  (80) 

where  Qe  s^nd  ]Ve  are  the  power  spectral  densities  of  the  process  and  measurement  noises  of 
the  modelled  modes. 

Define  the  error  between  the  estimated,  modelled  state,  Xm{t),  and  the  actual  state, 
Xm{t)',  as  em{t).  Then  as  in  [8] 

dG-mft)  ^  dXmft^  dXmf^') 

dt  dt  dt 


E[J) 


=  £  f  lim  i  / 

yT-^oo  T  Jo 


[Pa:,n(0>2:m(0]  +  [Au(t),  u(i)]dt 
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-  PeC*J¥;^Cm\em{t)  +  P.ClW:^CuXu{t) 
+P,C*JA^:^W-Vm 


(81) 


where  the  unmodelled  state  acts  as  a  forcing  term. 

In  order  to  study  the  entire  controller/observer  system,  stack  the  modelled  state,  the 
modelled  error,  and  the  unmodelled  state  and  consider  the  dynamics  of  the  stacked  system 
(82). 


dt 

dCm 

— 

dt 

1 

{Am-B,nK-^B*^P)  -BmA-^B*^P  0 

0  {A,,  -  W-iC',n) 

-BuA-^B^P  Au 


"1 

“ 

Xm 

- 1 

H 

_ 1 

+ 


-KeW  -  V„ 
Vu 

Prom  the  LQG  theory  of  the  past  sections. 


(82) 


and 

iAm-PeC:,Wr^Cm) 

are  stable.  However,  from  (82),  the  overall  system  may  not  be  stable  due  to  the  unmodelled 
actuator  influence  ,  Bu,  and  sensor  influence  ,  Cu,  matrices.  Therefore,  we  have  seen  that  in 
the  LQG  framework,  we  cannot  insure  overall  stability  unless  the  unmodelled  paxts  of  the 
system  axe  accounted  for. 

There  are  two  ways  to  ensure  system  (82)  is  stable,  assuming  Am  +  BmKCm  and  Au 
are  stable.  One  way  is  to  ensure  P„  =  0,  i.e.,  make  sure  the  unmodelled  dynamics  are 
uncontrollable  with  respect  to  the  actuator.  The  other  way  to  ensure  a  stable  system  is 
to  assure  Cu  =  0,  i.e.,  make  sure  the  unmodelled  dynamics  are  unobservable  with  respect 
to  the  sensor.  Controllability  and  observability  for  the  plane,  Poiseuille  flow  problem  were 
introduced  in  [2].  We  now  explore  how  we  may  achieve  these  conditions. 
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4.1  Point  Actuation  vs.  Distributed  Actuation 

One  way  to  guarantee  the  overall  system  (82)  is  stable  is  to  assure  all  modes  associated  with 
unmodelled  wavenumbers  are  uncontrollable  with  respect  to  the  input  by  making  =  0. 
In  the  fully  developed  channel  flow  system,  this  would  account  for  the  wavenumbers  left  out 
of  the  reduced-order  model.  If 

n 

where  the  n  range  corresponds  to  the  modelled  wavenumbers  only,  then  the  projection  of 
w{x)  onto  unmodelled  wavenumbers  is  zero  due  to  the  orthogonality  of  Fourier  components. 
As  a  result,  =  0  and  stability  of  unmodelled  dynamics  is  retained.  Note  that  since 
to(a:)  ^  0  for  all  but  a  flnite  number  of  points  in  the  x  direction,  this  type  of  scheme  is  a 
distributed  actuation  scheme.  Therefore,  by  moving  from  a  physically  easier  to  implement 
“point”  actuator  to  a  more  difficult  distributed  actuator,  we  have  retained  stability  of  the 
unmodelled  dynamics.  Physically,  a  distributed  actuator  is  obtained  by  a  large  number  of 
independently  programmable  actuators  placed  along  the  lower  wall.  Constraining  i«(a;)  to 
be  composed  of  modelled  wavenumbers  only,  thus  rendering  =  0,  is  manifested  in  the 
transfer  function  as  pole/ zero  cancellation  due  to  uncontrollable  modes.  Figure  3  shows  the 
poles  and  zeros  for  a  two  wavenumber  model,  a  —  0.5  and  a  =  1.0.  The  input  forcing 
function  ,u;(x),  is  sm(a:).  We  see  that  all  poles  associated  with  a  =  0.5  are  canceled  by 
zeros.  This  signifies  that  the  modes  are  uncontrollable.  Indeed,  all  poles  of  all  wavenumbers 
except  q:  =  1.0  are  uncontrollable.  If  distributed  actuation  is  infeasible  or  undesirable,  we 
must  look  to  the  dual  problem  of  sensing  to  gain  stability. 

4.2  Point  Sensing  vs.  Distributed  Sensing 

It  is  seen  from  (82),  that  if  ^  0,  stability  may  still  be  maintained  if  Cu  =  0.  This 
corresponds  to  making  all  unmodelled  wavenumber  dynamics  unobservable  with  respect  to 
the  shear  sensor.  Placing  a  single  shear  sensor  at  a  point  along  the  lower  channel  wall 
results  in  a  measurement  that  includes  the  effects  of  all  wavenumbers,  both  modelled  and 
unmodelled.  Clearly,  Cu  ^  0,  and  stability  is  not  guaranteed.  This  corresponds  to  the 


E  (83) 

=-N  ) 


w{x)  =  Real 
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Figure  3:  Pole  (x)  and  zero  (o)  configuration  of  channel  system  of  length  dx,  including 
wavenumbers  of  0.50  and  1.00;  Channel  model:  Re  =  10, 000,  shear  sensor  at  tt,  u;(a;)  = 
sin(a;).  Only  the  top  half  of  the  s  plane  is  shown. 

point  forcing  case  in  the  dual  problem  of  actuation.  By  using  a  distributed  sensing  scheme, 
however,  we  may  form  a  new  measurement  that  includes  only  the  effects  of  the  modelled 
wavenumbers.  This  is  done  by  projecting  a  distributed  shear  function,  z{x^  y  =  — l,t),  onto 
the  modelled  wavenumbers.  The  distributed  shear  function,  z{x,y  =  -l,t)  is  physically 
created  by  measuring  the  shear  at  all  points  along  the  lower  channel  wall.  Then  a  new 
projected  shear  measurement,  denoted  5(t),  is  defined  as 

z{t)  =  Real[z{x,y=  (84) 

nr=-Ar 

where  again  the  n  range  corresponds  to  modelled  wavenumbers  only.  Note  that  just  as  in 
the  actuator  case,  we  have  implemented  a  more  physically  complicated  series  of  sensors  in 
order  to  achieve  overall  stability. 

In  terms  of  modeling,  we  need  not  include  either  unobservable  or  uncontrollable  modes 
in  our  plant  models.  Therefore,  distributed  actuation  or  control  allows  models  to  be  created 
using  only  a  finite  number  of  wavenumbers.  Note,  however,  that  even  a  single  wavenumber 
model  contains  an  infinite  number  of  modes  due  to  the  infinite  number  of  poles  extending 


out  into  the  left  hand  s  plane  as  shown  in  Figure  1  (see  also  [2]). 


5  Control  Design  Using  High  Order  Models 

As  we  have  seen  in  section  4,  we  may  reduce  the  problem  of  including  an  infinite  num¬ 
ber  of  wavenumbers  in  a  reduced-order  model  to  a  problem  of  including  a  finite  number  of 
wavenumbers  by  using  distributed  actuation  or  sensing.  However,  even  with  a  model  con¬ 
taining  only  a  finite  number  of  wavenumbers,  the  problem  is  still  infinite-dimensional  because 
of  the  infinite  number  of  poles  extending  into  the  left  hand  s  plane  for  each  wavenumber. 
Furthermore,  we  do  not  know  the  exact  position  of  poles  far  into  the  left  hand  s  plane  due 
to  the  finite  number  of  basis  functions  used  in  the  y  direction  [4].  Still,  these  poles  must  be 
accounted  for  in  the  control  design.  As  we  shall  see,  the  fact  that  uncertain  poles  appear 
only  at  higher  frequencies  in  the  bandwidth  will  be  advantageous.  It  will  allow  a  robust 
controller  to  be  designed  that  “rolls  off”  at  high  frequencies.  Just  as  in  [5],  consider  the 
control  loop  shown  in  figure  4.  Using  the  same  notation  as  in  [5], 


Figure  4:  Feedback  control  loop.  P(s)  represents  the  plant  and  C{s)  represents  the  controller 
in  the  frequency  domain.  This  figure  taken  from  [5]. 


A 


r  =  external  (reference)  input 


•  e  =  measured  tracking  error 

A 

•  u  =  plant  input 


•  y  =  plant  output 
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•  d  =  disturbance  signal 

•  n  =  measurement  noise 

Note  that  the  reference  input  in  the  channel  flow  case  is  zero  since  we  would  like  to  suppress 


all  output.  The  following  relations  hold  for  the  closed  loop  system  [5]. 

y  =  PC{I  +  PC)-\r-n)  +  {I  +  PC)-'^d  (85) 

e  =  r-y-n  =  {I  +  PC)-\r-d)-{I  +  PC)-'^n  (86) 

We  assume  that  the  closed  loop  system  is  stable.  The  quajitity 

S  =  {I  +  PC)-'  (87) 

is  called  the  sensitivity  function  and  the  quantity 

T  =  PC{I  +  PC)-'^  (88) 

is  called  the  complementary  sensitivity  fimction.  Note 

S  +  T  =  I  (89) 


As  can  be  seen  from  (86),  for  good  tracking,  ||  r  -  y  ||  small  when  d  and  n  are  zero,  S  should 
be  small  [5].  This  corresponds  to  having  a  high  gain  at  all  frequencies.  Similarly,  as  can  be 
seen  from  (86),  for  good  disturbance  rejection,  i.e.  d  affects  y  at  little  as  possible,  again  S 
should  be  small  [5].  On  the  other  hand,  for  good  output  noise  suppression,  n  affects  y  as 
little  as  possible,  (85)  shows  that  T  should  be  small.  This  indicates  that  the  gain  should 
be  low  at  all  frequencies  in  which  the  noise  enters  [5].  It  is  generally  assumed  that  noise  is 
most  destructive  at  higher  frequencies.  As  a  result,  control  design  focuses  on  high  gain  at 
low  frequencies  where  tracking  and  disturbance  rejection  are  most  important  and  low  gain 
at  high  frequencies  where  noise  is  more  of  a  problem. 

In  addition  to  noise  at  high  frequencies,  the  other  major  problem  at  high  frequencies  is 
unmodelled  dynamics.  We  have  already  pointed  out  that  there  are  two  types  of  unmodelled 
dynamics  in  the  channel  flow  problem.  The  first  type  is  unmodelled  dynamics  of  unmodelled 
wavenumbers.  We  accounted  for  these  dynamics  through  distributed  control  or  distributed 
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sensing.  The  second  type  is  unmodelled  dynamics  at  high  frequencies  for  modeled  wavenum¬ 
bers.  This  type  of  unmodelled  dynamics  has  yet  to  be  considered  and  is  common  to  most 
infinite  dimensional  systems.  To  account  for  these  dynamics,  controllers  are  designed  that 
give  low  gain  at  the  high  frequencies  of  the  open-loop,  controller/plant  series  where  unmod¬ 
elled  dynamics  exist  in  order  not  to  stimulate  modes  at  those  frequencies.  This  is  known  as 
“roll-off”.  Roll-off  may  be  given  a  more  analytic  framework  by  considering  multiplicative, 
unstructured  uncertainty  in  the  plant,  P.  Suppose  that  we  are  uncertain  of  the  exact  form 
of  P.  Assume  that  P  can  be  perturbed  to 

Pn{s)  =  [/  +  N{s)\P{s)  (90) 

This  type  of  uncertainty  is  generally  prescribed  to  unmodelled  high  frequency  dynamics. 
It  has  been  shown  [9]  that,  in  this  case,  T  should  be  low  in  order  to  preserve  closed  loop 
stability.  Again,  this  corresponds  to  “roll-off”  of  the  open  loop  controller/plant  series. 

We  now  design  an  LQG  controller  and  then  observe  the  roll-off  characteristics  of  the 
resulting  system.  In  the  following,  7  will  refer  to  the  degree  of  closed  loop  stability  in  the 
both  the  LQ  controller  and  the  observer  as  shown  in  (44).  R  is  the  state  weighting  matrix 
in  the  LQ  cost  functional  (75),  A  is  the  input  weighting  matrix  in  the  LQ  cost  functional 
(75),  Qe  is  the  power  spectral  density  matrix  of  the  process  noise  (62)  ,  and  is  the  power 
spectral  density  matrix  of  the  measurement  noise  matrix  (62). 

We  consider  the  one  wavenumber  model  shown  in  figure  1  with  Re  =  10,000.  Only 
a  =  1.0  is  included  in  the  model.  All  other  wavenumbers  are  uncontrollable  due  to  the 
distributed  input  of  tt;(a:)  =  sin{x)  as  shown  in  section  4.1.  A  single  point  sensor  is  located 
at  TT.  The  length  of  the  channel  is  47r  leading  to  a  fundamental  wavenumber  of  cio  =  |.  We 
consider  two  models  in  evaluating  the  resulting  LQG  controller  as  shown  in  table  1. 

Model  1  contains  all  poles  and  zeros  to  the  left  of  s  =  —4.  Model  2  contains  all  observable 
and  controllable  poles  and  zeros  to  the  right  of  s  =  -2.  This  model  was  created  by  using 
the  minreal  function  within  the  MATLAB  Control  Toolbox  [10]  with  the  parameter  value 
tol=le-3.  The  Iqr  and  IqeB  functions  of  the  MATLAB  Control  Toolbox  were  used  to  create 
an  LQG  controller  using  model  2  as  the  system  model.  The  A  matrix  supplied  to  each  of 
these  MATLAB  functions  were  modified  to  (A  -f-  77)  in  order  to  achieve  a  prescribed  degree 
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Models 

Number 

Name 

Poles/Zeros 

1 

high-order 

253/251 

2 

slightly-reduced-order 

141/139 

Table  1:  Models  used  in  LQG  controller  design.  Note  that  the  difference  in  number  of  poles 
and  zeros  is  dictated  by  truncation  of  the  model  in  s  space. 

of  stability  as  described  in  sections  3.2  and  3.4  .  The  following  parameters  were  used: 


7  = 

-.08 

(91) 

R  = 

.001C'*C' 

(92) 

A  = 

I 

(93) 

Qe  = 

lOBB* 

(94) 

We  = 

1 

(96) 

The  R  matrix  was  chosen  so  as  to  minimize  shear  in  the  cost  functional  63;  the  Qe  matrix 
was  originally  chosen  to  recover  robustness  properties  using  loop  transfer  recovery  techniques 
[11];  7,  We,  and  A  were  chosen  using  trial  and  error. 

In  analyzing  the  resulting  control  system,  consider  the  following  model,  figure  5.  The 


r‘ 


1 


feedback  gain 


Figure  5:  LQG  control  loop  for  high-order  plant  (model  1)  and  reduced  order  controller 
(obtained  from  model  2).  The  value  of  K=1  will  lead  to  the  optimal  control. 


optimal  control  is  defined  dX  K  =  l  with  a  properly  designed  LQG  controller.  Figure  6  shows 
the  high-order  plant  (model  1)  used  for  validation.  Figure  7  shows  the  slightly-reduced-order 
model  (model  2).  Figure  8  shows  a  series  combination  of  the  high-order  plant  (model  l)and 
the  low-order  confro//er  obtained  from  the  LQG  design.  Finally,  figure  9  shows  the  root  locus 
of  the  controller/estimator  in  series  with  the  plant  for  gain  values,  K,  varying  from  0  to  4. 


Figure  6:  High-order  plant  (model  1)  contains  all  poles  and  zeros  to  the  right  of  s=-4.  (Only 
poles/zeros  to  the  right  of  s=-3  are  shown.)  Note  many  poles  are  either  uncontrollable 
or  unobservable  as  shown  by  pole/zero  cancellations.  These  poles  cannot  be  moved.  We 
concentrate  on  moving  only  the  observable/controllable  poles. 

From  figure  9,  we  see  that  the  poles  of  the  closed  loop  system  achieve  the  goal  of  being 
to  the  right  of  s  =  —.08  with  gain,  K  —  1.  This  represents  placement  of  closed  loop  poles  at 
approximately  double  the  distance  into  the  left  hand  plane  than  that  achieved  by  the  integral 
feedback  control  scheme  of  [2].  Finally,  figure  10  shows  the  magnitude  response  for  the  open 
loop  transfer  function  from  point  1  on  figure  5  to  point  2.  Note  that  the  gain  is  higher  at  low 
frequencies  and  lower  at  higher  frequencies.  At  /  =  1.0  ,  the  magnitude  response  “rolls-off”. 
Ideally,  we  would  like  to  see  roll-off  at  a  lower  frequency.  In  this  case,  however,  any  energy 
input  at  frequencies  above  /  =  Idid  not  hamper  our  stability  goals. 


26 


Figure  7:  Reduced  order  plant  used  for  LQG  design.  This  model  contains  only  controllable 
and  observable  poles  and  zeros. 


Figure  8:  Series  connection  of  high-order  plant  (model  1)  of  Figure  6  and  LQG  controller 
obtained  from  reduced  order  model  (model  2)  of  Figure  7. 


6  Suboptimal  Control 
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Although  we  achieved  our  goal  in  section  5,  we  designed  our  LQG  controller  with  a  very 
high  dimensional  slightly- reduced-order  plant  model  (model  2).  This  may  lead  to  numerical 
problems  if  the  design  were  attempted  with  a  new  model  that  contained  several  wavenumbers 
as  the  model  would  be  even  larger.  As  a  result,  we  would  like  to  develop  a  method  that 
uses  extremely  low-order  models  in  the  LQG  design.  By  examining  figure  8  ,  we  observe 
that  most  of  the  poles  and  zeros  of  the  high-order  plant  appear  to  be  inverted^  by  controller 
poles  and  zeros;  i.e.  controller  poles  are  placed  on  top  of  plant  zeros  and  vice-versa.  In 
fact,  these  poles  and  zeros  are  not  actually  inverted  by  the  optimal  controller,  but  nearly  so. 
A  suboptimal  controller,  then,  may  be  designed  that  a-priori  inverts  most  of  the  poles  and 
zeros,  except  for  a  few.  Only  the  poles  and  zeros  that  are  not  inverted  are  used  in  the  LQG 
design  procedure.  This  scheme  is  shown  in  figure  11. 


Figure  11:  LQG  control  loop  for  high-order  plant  (model  1),  inverted  model  (model  3),  and 
reduced-order  controller  obtained  from  model  4. 

We  define  a  few  new  models  to  use  as  part  of  this  scheme  as  shown  in  table  2.  Model  3 
contains  the  poles  and  zeros  used  to  invert  most  of  the  poles  and  zeros  in  model  2.  Model  4 
contains  the  poles  and  zeros  that  are  not  inverted. 

Note  that  pole/zero  cancellation  is  a  bad  idea  in  the  right  hand  s  plane.  If  a  non-minimum 
phase  zero  is  not  exactly  canceled,  a  new  instability  may  be  created  in  the  system.  In  fact, 

pole  or  zero  is  said  to  be  inverted  if  a  zero  (for  the  pole)  or  pole  (for  the  zero)  is  placed  in  the  same 
position. 
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Models 

Number 

Name 

Poles/ Zeros 

3 

partial-inverted 

132/132 

4 

very-low-order 

9/7 

Table  2:  Models  used  in  low-order  LQG  controller  design. 

we  can  never  actually  invert  any  pole  or  zero  because  we  can  never  know  the  exact  locations 
of  the  poles  or  zeros.  However,  non-exact  cancellation  is  not  a  problem  in  this  case  since 
all  poles  and  zeros  we  cancel  are  in  the  left  hand  s  plane.  The  slightly-reduced-order  model 
is  shown  in  figure  7.  Our  approach  is  to  a-priori  invert  most  of  the  poles  and  zeros  with 
model  3.  The  few  poles  and  zeros  that  are  not  inverted  (model  4)  are  shown  in  figure  12. 
A  LQG  controller  is  designed  using  only  these  poles  and  zeros  (model  4).  The  parameters 
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Figure  12:  Very-low-order  model  (model  4). 


for  the  design  are  the  same  as  in  section  5.  Figure  13  shows  the  transfer  function  of  the 
series  connection  of  high-order  model  (model  1),  partial  inverted  model  (model  3),  and  low- 
order  controller  obtained  from  model  4.  The  root  locus  of  the  system  is  shown  in  figure 
14.  By  comparing  the  root  locus’  shown  in  figures  9  and  14,  we  see  that  the  closed  loop 
pole  placement  is  very  similar  in  both  cases.  Indeed,  the  inverted  model  controller  scheme 
achieves  the  objective  of  all  observable/  controllable  poles  to  the  left  of  s  =  —.08. 

There  are  a  few  points  that  should  be  noted.  First,  the  controllers  designed  for  the 
two  cases  are  not  the  same,  even  though  they  are  similar.  This  can  be  particularly  noted  by 


Figure  14:  Root  locus  of  series  connection  shown  in  figure  13 
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observing  the  open  loop  controller  poles  on  the  real  axis.  In  the  slightly-reduced-order-model 
case,  figure  9,  one  pole  is  between  s  =  .5  and  s  =  .6,  while  in  the  partial-inverted-model  case, 
figure  13,  two  one  pole  is  between  s  =  .6  and  s  =  .7.  Second,  the  inverted  model  scheme  does 
not  reduce  the  overall  order  of  the  controller  that  must  be  implemented.  The  partial-inverted 
model  (model  3)  as  well  as  the  LQG  controller  must  be  considered  together  as  the  overall 
controller.  The  order  of  this  combination  is  the  same  as  that  of  the  original  slightly-reduced- 
order  LQG  controller.  However,  the  inverted  scheme  does  allow  a  very-low-order  model  to 
be  used  in  the  LQG  design  synthesis. 

Finally,  figure  15  shows  the  magnitude  response  for  the  open  loop  transfer  function  from 
point  1  on  figure  11  to  point  2.  Note  just  as  in  the  previous  section,  the  gain  is  higher  at 
low  frequencies  and  lower  at  higher  frequencies.  Again,  at  /  =  1.0,  the  magnitude  response 
“rolls-off”. 


Figure  15:  Magnitude  response  of  open  LQG  control  loop  for  inverted  model  scheme. 

In  order  to  investigate  the  robustness  of  the  inverted  model  scheme  for  varying  Reynolds 
numbers,  we  apply  the  inverted-model/very-low-order  LQG  controller  constructed  for  figure 
1  with  Re  =  10,000  to  a  single-wavenumber  system  for  Re  =  12,500  (figure  16).  The 
root  locus  for  the  higher  Reynolds  number  system  with  inverted  scheme  controller  is  shown 
in  figure  17.  It  can  be  seen  that  although  the  system  is  stabilized,  the  objective  of  all 
observable/ controllable  poles  to  the  left  of  s  =  — .08  is  not  achieved.  As  may  have  been 
expected,  the  inverted  model  scheme  is  not  robust  in  terms  of  degree  of  stability.  However, 
overall  stability  is  preserved. 
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Real  s  Axis 


Figure  16:  Pole  (x)  /Zero  (o)  Configuration,  Channel  model:  Re  =  12,500,  shear  sensor  at 
IT,  w{x)  =  sm(a:),L=47r,  a  =  1.0.  Only  the  top  half  of  the  s  plane  is  shown. 


Figure  17:  Root  locus  of  series  connection  of  partial-inverted  model  (model  3),  low-order 
controller  obtained  from  model  4,  and  Re=12,500  system  model  of  figure  16.  It  seen  that 
the  original  pole  near  s  =  —.05  -f-  .25i  never  moves  to  the  left  of  s  =  —.08. 


7  Conclusion 
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In  this  paper,  we  considered  linear  stabilization  of  plane,  Poiseuille  flow  using  linear  quadratic 
Gaussian  (LQG)  optimal  control  theory.  In  prior  investigations,  it  was  shown  that  plane, 
Poiseuille  flow  may  be  stabilized  using  extremely  simple,  integral  compensator  controllers, 
given  that  sensor  locations  produced  minimum  phase  zeros.  However,  the  controller  was 
marked  by  slow  dissipation  of  perturbation  energy.  In  this  investigation,  we  exploited  LQG 
methods  to  give  faster  perturbation  energy  dissipation.  We  showed  that  we  may  achieve 
approximately  double  the  dissipation  rate  than  that  reported  in  integral  compensator  control 
schemes.  This  increase  in  performance  came  with  a  cost.  Controllers  built  using  LQG 
methods  for  infinite-dimensional  systems  must,  in  theory,  be  infinite-dimensional.  Since  it 
is  impossible  to  implement  infinite-dimensional  controllers,  we  implemented  reduced-order 
controllers  on  the  full-order  plant.  We  showed  that  this  procedure  can  in  theory  lead  to 
de-stabilization  of  unmodelled  dynamics.  We  then  showed  that  this  may  be  avoided  using 
distributed  control  or,  dually,  distributed  sensing.  An  additional  problem  in  high-plant- 
order  LQG  controller  design  is  numerical  inaccuracy  in  the  synthesis  equations.  We  showed 
a  sub-optimal  LQG  synthesis  method  that  uses  an  extremely,  low-order  plant  model  for 
synthesis  of  a  high-order  controller.  This  involved  the  use  of  a  left-hand-plane  inversion 
model.  This  sub-optimal  controller  produced  results  essentially  equivalent  to  the  optimal 
controller.  Finally,  we  applied  our  LQG  controller  to  a  plant  of  Reynolds  number  12,500 
instead  of  10,000  used  in  the  LQG  design.  Using  this  new  plant,  we  showed  that  the  system 
was  still  stabilized.  However,  the  goal  of  double  dissipation  rate  was  not  achieved,  showing 
the  inherent  non-robustness  (in  terms  of  degree  of  stabilization)  of  this  controller.  In  this 
paper,  we  have  examined  LQG  control  methods.  Other  control  approaches  exist  that  are 
based  on  worst-case  design.  These  approaches  include  disturbance  attenuation  design  [12] 
and  H°°  design  [13].  In  the  future,  these  designs  may  lead  to  enhanced  performance  in  fluid 
control  problems. 
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A  systems  theory  framework  is  presented  for  the  linear  stabilization  of  two-dimen¬ 
sional  laminar  plane  Poiseuille  flow.  The  governing  linearized  Navier-Stokes  equations 
are  converted  to  control-theoretic  models  using  a  numerical  discretization  scheme. 
Fluid  system  poles,  which  are  closely  related  to  Orr-Sommerfeld  eigenvalues,  and 
fluid  system  zeros  are  computed  using  the  control-theoretic  models.  It  is  shown 
that  the  location  of  system  zeros,  in  addition  to  the  well-studied  system  eigenvalues, 
are  important  in  hnear  stabihty  control.  The  location  of  system  zeros  determines  the 
effect  of  feedback  control  on  both  stable  and  unstable  eigenvalues.  In  addition,  system 
zeros  can  be  used  to  determine  sensor  locations  that  lead  to  simple  feedback  control 
schemes.  Feedback  controllers  are  designed  that  make  a  new  fluid-actuator-sensor- 
controller  system  linearly  stable.  Feedback  control  is  shown  to  be  robust  to  a  wide 
range  of  Reynolds  numbers.  The  systems  theory  concepts  of  modal  controllability  and 
observability  are  used  to  show  that  feedback  control  can  lead  to  short  periods  of  high- 
amplitude  transients  that  are  unseen  at  the  output.  These  transients  may  invalidate  the 
linear  model,  stimulate  nonlinear  effects,  and/or  form  a  path  of ‘bypass’  transition  in  a 
controlled  system.  Numerical  simulations  are  presented  to  validate  the  stabilization  of 
both  single-wavenumber  and  multiple-wavenumber  instabilities.  Finally,  it  is  shown 
that  a  controller  designed  upon  linear  theory  also  has  a  strong  stabili^ng  effect  on 
two-dimensional  finite-amplitude  disturbances.  As  a  result,  secondary  instabilities  due 
to  infinitesimal  three-dimensional  disturbances  in  the  presence  of  a  finite-amplitude 
two-dimensional  disturbance  cease  to  exist. 


1.  Introduction 

A  basic  problem  in  fluid  dynamics  is  the  theoretical  understanding  of  how  insta¬ 
bility  in  laminar  shear  flow  leads  to  transition  to  turbulence.  Since  laminar  flow 
is  preferred  in  many  applications,  the  suppression  of  fluid  instabilities  to  maintain 
laminar  flow  would  be  very  useful.  Towards  this  goal,  active  boundary  layer  control 
of  instability  has  been  proposed.  The  nonlinear  aspects  of  the  transition  process  are 
still  not  clearly  known.  It  has  been  shown  (Orszag  &  Patera  1983)  that  some  shear 
flows  may  sustain  two-dimensional  finite-amplitude  instabilities  that  cause  infinites¬ 
imal  three-dimensional  disturbances  to  be  highly  unstable.  This  two-dimensional 
primary  instability/three-dimensional  secondary  instability  process  may  be  one  non¬ 
linear  mechanism  that  leads  to  transition.  Conversely,  the  behaviour  of  infinitesimal 

t  Present  address:  Jet  Propulsion  Laboratory,  4800  Oak  Grove  Drive,  Pasadena,  CA  91109,  USA. 
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(linear)  disturbances  in  laminar  shear  flow  is  well  understood  (Orr  1907;  Sommerfeld 
1908;  Drazin  &  Reid  1981).  Therefore,  linear  analysis  provides  a  natural  starting 
point  to  begin  to  develop  active  control  schemes  that  may  eventually  lead  to  full 
transition  control.  It  is  the  purpose  of  this  paper  to  develop  feedback  controllers 
based  on  linear  theory  that  stabihze  two-dimensional  plane  Poiseuille  flow  to  in¬ 
finitesimal  disturbances.  In  addition,  it  will  be  shown  that  a  controller  designed 
upon  linear  theory  has  a  strong  stabilizing  effect  on  two-dimensional  finite-amplitude 
instabilities.  As  a  result,  secondary  three-dimensional  instabilities  as  described  in 
Orszag  &  Patera  (1983)  cease  to  exist  in  such  a  system. 

Most  prior  work  in  the  area  of  laminar  flow  linear  instability  suppression  has 
concentrated  on  the  wave  superposition  approach.  A  nice  survey  of  past  work  is 
given  in  Joslin,  Erlebacher  &  Hussaini  (1994).  The  basic  idea  is  that  boundary  layer 
instabilities  appear  as  a  combination  of  many  sinusoidally  growing  waves  of  certain 
frequencies,  phases,  and  amphtudes.  If  these  wave  parameters  are  known,  or  if  they 
can  be  determined,  a  second  wave  may  be  stimulated  in  the  flow  that  is  exactly  out 
of  phase  with  the  instability  wave.  In  this  way,  the  two  waves  may  destructively 
interfere  and  flow  may  be  stabilized.  Disturbance  of  the  flow  field  to  cause  a 
wave  motion  to  appear  has  been  experimentally  demonstrated  using  several  methods 
such  as  vibrating  ribbons  (Schubauer  &  Skramstad  1947),  vibrating  wires  (Milling 
1981),  and  heating  elements  (Nosenchuck  1982).  In  addition,  several  authors  have 
numerically  obtained  wave  superposition  results  based  on  Navier-Stokes  simulations 
(Beringen  1984;  Metcalfe  et  al.  1986).  Most  of  the  results,  however,  report  incomplete 
destruction  of  instability  waves.  Joslin  et  al.  (1994)  explain  that  wave  cancellation 
is  very  sensitive  to  the  wave  parameters  and  postulate  that  incomplete  destruction 
reported  in  past  studies  was  due  to  improper  phase  or  amplitude  properties  of  the 
cancelling  wave. 

In  addition.  Bower,  Kegelman  &  Pal  (1987)  considered  the  Orr-Sommerfeld  equa¬ 
tion  in  designing  an  input  to  channel  flow  that  may  counteract  the  effects  of  a 
disturbance  that  excites  instabilities.  They  showed  that  if  an  oscillating  flat  pulse 
function  at  the  lower  boundary  of  a  channel  excited  inherent  instabilities,  a  second 
oscillating  flat  pulse  function  could  be  constructed  downstream  that  negated  excite¬ 
ment  of  the  instabilities.  Much  like  the  wave  superposition  approach,  their  aim  was 
not  to  stabilize  the  underlying  dynamics  of  the  problem,  but  rather  to  ‘cancel’  the 
effects  of  a  specific  disturbance. 

Unlike  past  work,  our  aim  is  to  use  systems  theory  to  construct  a  combination  fluid- 
actuator-sensor-controller  system  that  is  inherently  stable.  In  essence,  the  approach 
changes  the  philosophy  of  the  problem  from  thinking  about  how  inputs  can  mitigate 
an  inherently  unstable  system  to  thinking  about  how  sensors  and  actuators  can  be 
added  to  form  an  entirely  new  stable  system.  Recently,  mathematical  control  systems 
theory  has  begun  to  be  applied  to  fluid  systems  (Bums  &  Ou  1994;  Gunzburger, 
Hou  &  Suobodny  1992;  Choi,  Moin  &  Kim  1993).  A  control  theory  approach  for 
laminar  flow  linear  instability  suppression  will  be  shown  to  have  several  advantages 
over  the  traditional  wave  cancellation  approach.  It  will  eliminate  the  need  to  explicitly 
measure  phase  and  frequency  of  instabilities.  Also,  it  will  provide  a  framework  to 
select  sensor  locations  in  order  to  have  the  least  complex  controller.  Further,  feedback 
control  will  be  shown  to  be  extremely  robust  to  changing  Reynolds  numbers  given 
proper  sensor  location.  In  addition,  linear  feedback  controllers  will  be  shown  to  have 
a  strong  stabilizing  effect  on  two-dimensional  finite-amplitude  disturbances. 

This  paper  is  organized  as  follows.  In  §2,  we  formulate  the  linear  channel  flow 
problem  using  the  linearized  Navier-Stokes  equations.  Boundary  input  in  the  form 
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of  blowing/suction  and  boundary  output  in  terms  of  shear  are  represented  within  the 
equations.  In  §3,  we  introduce  the  concepts  of  zeros  and  poles  of  a  system,  as  well  as 
control-theoretic  models.  The  governing  partial-differential  equation  for  the  system 
is  converted  into  a  set  of  first-order,  ordinary  differential  equations  via  a  Galerkin 
method.  These  first-order,  ordinary  differential  equations  are  then  converted  into  a 
control-theoretic  model.  Section  4  describes  the  infinite-dimensional  nature  of  the 
channel  flow  system  and  how  it  affects  the  selection  of  actuation.  Section  5  describes 
the  numerical  model  and  the  verification  of  the  calculated  poles  and  zeros.  Section  6 
introduces  feedback  control  design.  It  is  shown  that  judicious  sensor  placement,  based 
on  zero  locations,  can  lead  to  simple  control  schemes.  Furthermore,  the  control  system 
is  extremely  robust  to  change  in  Reynolds  number.  Section  7  explores  unobservable 
modal  reinforcement  as  a  possible  path  of  ‘bypass’  transition  in  a  controlled  system. 
It  then  shows  how  a  particular  control  model,  called  the  modal-canonical  state 
space  model,  may  be  used  to  assess  observability  of  modal  reinforcement.  Section  8 
demonstrates  multiple-wavenumber  instability  control.  Section  9  demonstrates  that  a 
linear  controller  has  a  strong  stabilizing  effect  on  two-dimensional,  finite-amplitude 
instabilities.  As  a  result,  secondary  three-dimensional  instabilities  as  described  by 
Orszag  &  Patera  (1983)  cease  to  exist.  Finally,  §10  outlines  conclusions. 


2.  Problem  formulation 


2.1.  Dynamic  equations 

We  consider  two-dimensional  plane  Poiseuille  flow  between  two  parallel  stationary 
plates.  Let  the  channel  be  of  finite  length  and  finite  height,  with  the  centreline  at 
zero.  The  flow  in  the  channel  is  described  by  the  unsteady  nonlinear  incompressible 
Navier-Stokes  equations.  In  order  to  study  the  linear  stability  of  the  system,  we 
follow  the  standard  procedure.  Consider  small  perturbations  in  the  velocities  of 
u(x,y,t)  in  the  horizontal  direction,  v(x,y,t)  in  the  vertical  direction,  and  p{x,  y,t)  in 
the  pressure  field.  Let  the  primary  flow  be  represented  by  U{y)  with  Uc  being  the 
centreline  velocity.  The  linearized  incompressible  Navier-Stokes  equations  may  be 
formed  by  substituting  the  primary  flow  and  small  perturbations  into  the  nonlinear 
incompressible  Navier-Stokes  equations  and  disregarding  the  second-order  terms 
involving  the  perturbations. 


du(x,y,t)  ,  3M(x,y,t)  ,  dl7(y)^,^  __  _  dp{x,y,t)  ^  1 


dt 


+  U(y) 


dx 

dv{x,y,t) 


+  ■ 


dy 


-a(x,y,t)  = 


dx 


i,  ax 

du{x,y,t)  dv{x,y,t) 


dv{x,y,t) 


dx 

dpix,y,t) 


dy 


+  u{x,y,t), 


dy 


=  0, 


(2.1) 

(2.2) 

(2.3) 


where  the  flow  variables  are  non-dimensionalized  by  the  channel  half-height,  H,  and 
centreline  velocity,  Uc.  Re  is  the  Reynolds  number  defined  as  {UcH/v)  where  v  is  the 
kinematic  viscosity.  By  introducing  a  ‘stream  function’,  xp{x,  y,  t). 


A  dy}(x,y,t) 
uix,y,t)  =  —^ 


(2.4) 


and 


w(:>c,y.  t)  = 


dxp{x,y,  t) 
dx 


(2.5) 
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(2.1)-(2.3)  may  be  combined  into  a  single  equation, 


d  d^xp  d  d^xp 
dt  dx^  dt  dy'^ 


=  -  Uiy) 


d  d^xp 
dx  dy^ 


+ 


d^l7(y)  dxp 
dy^  dx 


+  ^vVV). 


(2.6) 


Assume  periodic  boundary  conditions  in  the  streamwise  (x)  direction.  For  channel 
flow,  with  rigid  plates  at  y  =  — 1  and  y  =  1,  the  no-slip  boundary  conditions  become 


iy(x,y  =  -l,t)  =  0,  (2.7) 

5^(x,y  = -1,0  =  0,  (2.8) 

dy 

w{x,y  =  ht)  =  o,  (2.9) 

^(x,y  =  1,0  =  0.  (2.10) 

With  an  initial  condition 

xp(x,y,  t  =  0)  =  g(x,y)  (2.11) 


the  boundary  value  problem  is  completely  formed.  Existence  and  uniqueness  of  solu¬ 
tions  for  the  linearized  Navier-Stokes  equations  have  been  studied  in  Ladyzhenskaya 
(1969),  Kreiss  &  Lorenz  (1989),  and  Temam  (1984).  Equations  (2.6)-(2.11)  represent 
the  starting  point  for  construction  of  a  feedback  control  system.  These  equations 
neither  include  any  control  terms  nor  do  they  describe  any  sensing  of  flow  field 
variables. 


2.2.  Boundary  input 

We  consider  the  case  of  blowing/suction  at  the  lower  wall  of  the  channel.  The  bound¬ 
ary  conditions  are  now  modified  from  before  to  include  boundary  input,  represented 
as  the  known  separable  function  g(t)w(x)/(y). 


xp{x,y  =  -1,  t)  =  q{t)w{x)f{y  =  -1), 

(2.12) 

ay  (^»  3^  =  1,  t)  =  4(0w(x)  ^  =  0, 

(2.13) 

o 

II 

II 

(2.14) 

(2.15) 

Note  that  these  conditions  constrain  the  function  f{y)  such  that 

/(y  =  -l)^0. 

(2.16) 

o' 

II 

1 

II 

(2.17) 

/(y  =  l)  =  0. 

(2.18) 

5/(y  =  l)  Q 
dy 

(2.19) 

Many  functions  may  be  equally  appropriate.  One  such  function  is 

(2.20) 
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In  order  to  relate  boundary  conditions  on  xp  to  blowing/suction  in  the  wall-normal 
direction,  we  use  (2.5)  to  relate  ij(x,  y,  t)  and  y}{x,y,t).  Then  (2.12)  becomes 

3w(x)_ 


n(x,y  =  -l.t)  =  -q(t)^f{y  =  -1). 


(2.21) 


Note  that  v{x,y,t)  is  related  to  the  derivative  of  w(x).  The  homogeneous  equation 
(2.6)  and  the  inhomogeneous  boundary  condition  (2.12)  can  be  converted  into  an 
inhomogeneous  equation  with  homogeneous  boundary  conditions  by  introducing 


<^(x,  y,  t)  =  xp{x,  y,  t)  -  q{t)f{y)w{x). 
Then  by  substituting  into  (2.6),  we  obtain 


(2.22) 

1 


^  +  J_^  +  2 _ 

dt  dx^  dt  dy'^  dx^  dx  dy'^  dy^  dx  Redx*  Redx^dy^ 


Re  dy^ 


dt  dx^ 


Sq{t)  ,  ^d^fiy)  ,^^d^w(x) 

-wW-13- -  W/()') 


dt 


dy^ 


dx^ 


-  U(y)^ + 


dx 


dy^ 


dx  dy^ 


Re 


dx* 


.  ,  1  ^,^,dMx)d^ny)  ,  1  .aV(y) 

+  Sx^  dy^  8/  ■ 

(2.23) 

The  boundary  conditions  in  terms  of  (j)  are  now 

<^(y  =  -l)  =  0. 

(2.24) 

5<^(y  =  -i)  n 

dy 

(2.25) 

(/.(y  =  l)  =  0. 

(2.26) 

d<j>iy  =  l)  . 
dy 

(2.27) 

2.3.  Boundary  output 

We  use  the  streamwise  component  of  shear  at  a  single  boundary  point,  z(xi,  y  =  —1,  t), 
as  our  boundary  output,  which  is  given  by 

3«(x,-,y  =  -l,t) 


ziXi,y  =  -Ut)  = 


dy 


By  expressing  M(Xi,y  =  —  1,  t)  in  terms  of  the  stream  function  (2.4), 

d^y}(Xi,y  =  -l,t) 


z(Xi,y  =  -1,0  = 


and  by  observing  (2.22) 

d^\p{Xi,y  =  -l,t) 


z(Xi,y  =  -1,0  = 


3y2 


5y^ 

d^<t){Xi,y  =  -l,t) 
dy'^ 


(2.28) 


(2.29) 


+  g(0-%.:,--^w(x,). 


dy^ 


(2.30) 


3.  Zeros,  eigenvalues,  and  control-theoretic  models 
Linear  stability  analysis  of  (2.6)  (Drazin  &  Reid  1981)  shows  that  the  system  is 
linearly  unstable  for  a  range  of  Reynolds  numbers.  The  goal  of  this  paper  is  to 
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stabilize  the  system  using  control  theory.  To  do  this,  we  first  transform  the  governing 
equations  into  special  control-theoretic  models. 


3.1.  System  transfer  function 

We  have  defined  a  single-input/single-output  (SISO)  system  in  the  sense  that  only 
one  scalar  function,  q{t),  defines  the  entire  input  and  one  scalar  function,  z(t),  defines 
the  output.  A  common  form  of  control  model  for  a  finite-dimensional  SISO  system 
is  the  transfer  function  model.  The  transfer  function,  H{s),  is  defined  as  the  Laplace 
transform  of  the  output,  z(t),  divided  by  the  Laplace  transform  of  the  input,  q{t), 
where  zero  initial  conditions  are  assumed.  Then 


A  ^[z{t)]  _  Z(s) 
nq(t)]  QisY 


(3.1) 


For  finite-dimensional  systems,  Z(s)  and  Q{s)  take  the  form  of  polynomials  in  the 
complex  variable  s.  These  polynomials  may  be  factored  to  yield  an  equivalent 
representation. 


*  n.-P,)' 

i=l 


(3.2) 


In  this  form,  p\...pi  are  the  poles  of  the  system.  The  poles  of  any  system  are  dependent 
solely  on  the  physics  of  the  underlying  system,  independent  of  any  particular  input  or 
output.  Unstable  modes  of  the  system  appear  as  poles  whose  real  part  is  greater  than 
zero.  As  will  be  seen  in  later  sections,  the  poles  are  closely  related  to  the  eigenvalues 
of  the  Orr-Sommerfeld  equation.  The  values  Ci  •  •  •  Cj  are  the  zeros  of  the  system.  They 
are  heavily  dependent  on  which  particular  inputs  and  outputs  are  used  on  the  system. 
As  will  be  seen  later,  the  position  of  these  zeros  will  dictate  sensor  locations  and  will 
reveal  the  effect  of  feedback  control  on  the  eigenvalues.  A  graphical  representation  of 
the  transfer  function  can  be  produced  by  plotting  the  poles  and  zeros  in  the  complex 
s-space. 


3.2.  State-variable  model 

Much  of  modern  control  theory  is  based  on  the  state-variable  representation  of  a 
dynamic  system.  This  representation  relies  on  the  basic  fact  that  the  motion  of  any 
finite-dimensional  dynamic  system  may  be  expressed  as  a  set  of  first-order  ordinary 
differential  equations.  As  a  simple  example  of  a  state  variable  model  (Franklin,  Powell 
&  Emami-Naeini  1988),  Newton’s  law  for  a  constant  single  mass,  M,  moving  in  one 
dimension,  x,  under  a  force,  F{t),  is 


M 


d2x(t) 

dfi 


=  Fit). 


(3.3) 


If  we  define  one  state  variable  as  the  position  xi  =  x(t)  and  the  other  state  variable 
as  the  velocity  X2  =  dx(t)/dt,  (3.3)  can  be  written  as 


dxi 

“dF 


=  X2, 


dx2  _  Fit) 
_  _ 


(3.4) 


(3.5) 


Systems  theory  approach  to  feedback  stabilization  of  plane  Poiseuille  flow  163 


Furthennore,  the  first-order  linear  ordinary  differential  equations  can  be  expressed 
using  matrix  notation 


r  dxi  1 


L  dt  J 

or 

dx  ^  ^ 

—  =  Ax  +  Bq.  (3.7) 

If  we  take  output  as  position,  xu 

.=[1  0] 

or 

z  =  C’X  (3.9) 

The  matrix  A,  and  the  vectors  B  and  C  are  called  the  state  space  matrices  of  the 
single-input,  single-output  (SISO)  system.  More  specifically,  the  A  matrix  is  referred 
to  as  the  dynamic  matrix  of  the  system.  It  can  be  shown  that  the  poles  of  the  system 
are  simply  the  eigenvalues  of  the  A  matrix.  In  the  more  general  case  of  multiple-input, 
multiple-output  (MIMO)  systems,  B  and  C  are  matrices.  For  generality,  we  may  add 
another  term,  Dq,  to  the  output  equation  to  account  for  systems  in  which  there  is 
direct  feedthrough  from  the  input  to  the  output.  Then, 


Xl 

X2 


(3.8) 


dt 

dX2 


0 

1 

Xl 

+ 

0 

0 

0 

.  ^2  ^ 

1 

M 


(3.6) 


z  —  C  •  x  +  Dq 


(3.10) 


In  the  case  of  no  direct  feedthrough,  D  =  0,  the  state  space  and  transfer  function 
models  are  related  as 


(3.11) 


3.3.  State-space  formulation  for  channel  system 

We  will  convert  our  problem  into  a  set  of  first-order  ordinary  differential  equations 
and  then  form  a  state-space  model  from  these  equations.  The  state-space  model  can 
then  be  represented  with  transfer  function  poles  and  2eros.  We  will  proceed  in  this 
way  for  two  reasons.  First,  our  system  lends  itself  to  decomposition  into  first-order 
ordinary-differential  equations  by  use  of  a  Galerkin  method.  More  importantly, 
however,  the  state-space  model  lends  itself  to  extremely  elegant  ways  to  control 
eigenvalues  of  a  system  in  well  prescribed  ways.  It  should  be  noted  that  unlike  the 
single-mass  example  given  above,  the  channel  system  requires  an  infinite  number  of 
ordinary  differential  equations  to  describe  its  motion.  This  is  known  as  an  infinite¬ 
dimensional  system.  As  a  result,  any  finite  number  of  ordinary-differential  equations 
used  in  a  state-space  model  will  not  completely  describe  the  system.  The  difficulties 
associated  with  such  a  system  are  taken  up  after  a  discussion  of  the  Galerkin  method 
used  to  obtain  the  ordinary-differential  equations. 


3.3.1.  First-order  system 

We  use  a  standard  Galerkin  procedure  to  convert  the  governing  partial  differ¬ 
ential  equation  (2.23)  into  a  system  of  ordinary-differential  equations.  Assume  an 
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approximate  solution  of  (2.23)  as 


N  M 


(3.12) 


n— — N  m=0 


By  using  Fourier  functions,  for  P„(x)  and  basis  functions  constructed  from 

Chebyshev  polynomials  for  r„(y)  (Joshi  1996),  we  obtain  a  first-order  system.  Define 
inner  products  in  the  streamwise  (x)  and  normal  (y)  directions  respectively: 

1  . 

e“^*e-™*»Mx  =  6„,„, 

-L/2 


1 

j-gimox  gimotoxj^  A  ^  / 

^  J-l 


A  27C 

«o  =  ^, 


and 


[Fm(y),Fn(y)],.  - 


(3.13) 

(3.14) 


where  L  is  the  non-dimensional  length  of  the  finite-length  channel.  Applying  the 
orthogonality  of  the  basis  functions  in  x,  we  obtain  a  system  of  first-order  ordinary 
differential  equations: 


E  + E  =  E  »,.(«)«>«  -  E 


2+ 

mfe 


m=0 

M 


m=0 
M 


m=0 


m=0 


..  iw  ^  M 

m— 0  m=0  m=0 

+  ^  E  «"«(t))8^  +  ^{5,1}  +  g(t){S,i},  l  =  -N...N,k  =  0...M, 

fn^=0 


where 


cl  A 
A(/i  — 


and 

—  — 
^Ik  — 


d^w{x) 


dx^ 

rd^w(x) 


,P((x) 


+ 

+2 


dx^ 

5w(x) 


,P/(x) 


dx 


,Pi(x) 


,  1 

’5^w(x) 

Re 

dx^ 

Jx  L  dy2 


[l’(y),  A(3')]y  -  [w(x),P,(^)] 

’dw(x) 

1 

v(y),rk(y) 


sMy) 


[  Sy^ 


,rkiy) 


(3.15) 


(3.16) 


Jy 


[C/(y)a(y),A(y)] 

■dWiy) 


dx 


,Piix) 

5‘'iv(x) 


Uiy) 


dMy) 

dy^ 


,n(y) 


5x4 


,Piix) 


[i^(y).A(y)]v 


dy^ 


.A(y) 


1 


Jy 


+  ^  [W(X),P;(X)], 


d^viy) 


L  ^y* 


,rk(y) 


.(3.17) 


jy 


In  this  system,  oq  is  the  fundamental  wavenumber  in  the  x-direction,  defined  as  2n/L, 
and  the  P  coefficients  are  defined  in  terms  of  the  following  scalar  products,  where  the 
r(y)  are  the  bcisis  functions  in  y: 


pL  = 


dyj 


;  =  0...4, 


Jy 


C  =  [U{y)rAy),rkiy)]y, 


»0-  A 
mk 


d^ujy) 

[ 


rmiylPkiy) 


(3.18) 

(3.19) 

(3.20) 


Jy 
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d^Tmly) 

uiy)-^,rdy) 


dy^ 


(3.21) 


■iy 


3.3.2.  Transformation  to  state-space  form 
We  may  visualize  (3.15)  in  block  matrix  form  as 

ddi^^N 


M_n  0 

0  M_jv+i 

0  0 

0  0 


0 


0  1 
0 


0 

0  Mjv  J 


aai=-N+\ 

dt 

ddi^N 

dt 


K_n  0  ••• 

0  K-n+i  0 

0  0 

0  0  0 


0 

0 


n 

ai=-N 

ai=-N+i 

.  h=N  . 

+  [  r,  Y2] 


qit) 

dqjt) 

dt  J 


(3.22) 


where  a/=p  denotes  a  column  vector  of  all  aik  coefficients  whose  first  index,  I,  is  the 
value  p.  In  compact  notation, 


=  Y2] 


qit) 

dqjt) 

dt  J 


Assuming  non-singularity  of  the  M  matrix,  we  may  invert  to  obtain 


da 


0.(1  1  r  t  IT 

3-  =  /W-‘Ka-J-  M-^Yi  M-^Y2 
at  ■' 


q{t) 

dq{t) 


(3.23) 


(3.24) 


L  dr  J 

where  a,  K,  Ti  and  12  are  all  complex.  By  expanding  in  terms  of  real  and  imaginary 
parts,  we  obtain 

dill?  dill  t  t  t  * 

-H 1-^  =  M-^KrOr  iM-^KittR  -H  iM-^KrUi  -  M-^K,ai 

,_lv.  Mt)  ,  :„_iv.  dq(r) 


+M  * lii;g(t)  +  iM  ' yi;^(r) -h  IW  ’y2R— t; — l-iM  *12;—^ 

dr  dr 

where  the  subscript  R  or  I  indicates  real  or  imaginary  parts.  Define 


-  A 

P  = 


fl/ 

qit) 


(3.25) 


(3.26) 


Then  the  state  space  system  is 


dr 

We  write  (3.27)  as 


■  M-^Kr 

-M-^Ki  M-^Yir  ' 

1 

S 

1 

= 

M-^Ki 

M-'Kr 

M-^Yu 

P  + 

M-^Yv 

1 - 

0 

0 

1 

0 

1 

dqjt) 

dt 


(3.27) 


(3.28) 
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where  the  input  to  the  system  is  d^(t)/dt.  Note  since  q{t)  in  (2.12)  is  related  to 
physical  blowing/suction,  it  is  important  to  keep  in  mind  that  the  input  to  this  model 
is  the  derivative  of  q{t).  In  physical  terms,  any  input  derived  from  this  model  will 
be  in  terms  of  dq{t)/dt  and  therefore  must  be  integrated  before  it  can  be  applied  as 
physical  blowing/suction. 


3.3.3.  Measurement  equations 

By  observing  (2.30)  and  using  the  assumed  solution  form  (3.12) 

=  ^  — -w(xi)  +  pit,Xi,y  =  -1) 


n—-N  m=0 


(3.29) 


where  p{t,  Xt,  y  =  — 1),  the  residual  component  of  streamwise  shear  due  to  the  truncated 
expansion,  is  assumed  to  be  negligible  compared  to  the  two  other  terms.  The  second 
term  on  the  right-hand  side  of  (3.29)  is  made  up  of  the  known  input  terms  (2.12). 
Denote 


(3.30) 


By  pulUng  out  the  complex  time  coefficients  and  denoting  them  as  the  vector  a  as 
above  we  may  construct  a  complex  observation  matrix,  O 


z(x„  y  =  — 1,  t)  =  Oa  +  Dq{t). 


(3.31) 


Finally,  by  creating  p  by  stacking  the  real  and  imaginary  parts  of  a,  as  well  as  q{t), 
we  may  construct  an  observation  equation  in  state-variable  form 


ZRiXi,y  =  -1,0 

d 

1 

o 

1 _ 

z/(X(,y  =  -l,0 

o 

o 

o 

_ 1 

(3.32) 


Since  we  may  measure  only  real  output  (shear),  we  are  left  with  only  the  top  half  of 
the  observation  matrix. 


ZR(Xi,y  =  -l,f)=  [  Or  -0/  D]p.  (3.33) 

In  order  to  describe  the  system  in  traditional  control  terms,  define 


Then, 


C^[Or  -Ot  D]. 
ZR(xi,y  =  -l,t)  =  Cp 


(3.34) 

(3.35) 


3.3.4.  Initial  conditions 

We  have  not  accounted  for  the  initial  value  (2.11)  in  our  boundary  value  problem. 
We  may  account  for  the  initial  condition  by  assuming  it  can  be  written  as  a  series 
expansion  in  terms  of  Pn{x)  and  r„(y).  Then 

(t>{x,y,  t  =  0)  =  ip(x,y,  t  =  0)-s(t  =  0)/(y)w(x)  =  g(x,y)  -  s(t  =  0)f(y)w(x) 

N  M 

=  E  Y^bMx)r„iy) 

n=-‘N  m=0 


(3.36) 
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where  b„m  are  assumed  known.  Then 


a{t  =  0)  =  b.  (3.37) 

Since  a  and  b  are  both  complex,  stack  the  real  and  imaginary  components  to  arrive 
at  an  initial  condition  on  p  : 


Pit  =  0)  = 


bR 

bi 

qit  =  0) 


(3.38) 


We  have  now  defined  the  system  completely  in  the  state-variable  form  of  (3.7),  (3.9) 
where  A  and  B  are  given  in  (3.28),  C  is  given  in  (3.35),  and  the  initial  condition  is 
(3.38).  Although  we  have  shown  the  state-space  formulation  using  a  single  input  and  a 
single  output,  we  may  formulate  the  multiple  input/multiple  output  case  similarly.  In 
that  case,  the  multiple  input  functions  are  represented  as  a  sum  of  separable  functions 
in  (2.12)  and  multiple  outputs  are  represented  as  a  vector  whose  components  are  of 
the  form  (3.35).  It  should  be  noted  that  in  the  multiple  input  case,  it  is  important  for 
physical  implementation  that  for  any  finite  stretch  in  the  x-direction,  only  one  of  the 
multiple  input  functions  is  non-zero. 


4.  Infinite-dimensional  nature  of  the  channel  problem  and  efifect  on 
actuator  design 

In  a  system  described  by  partial-differential  equations,  such  as  the  channel  flow 
problem,  no  finite-dimensional  model  will  capture  all  the  dynamics  of  the  system. 
Such  a  system  is  called  infinite-dimensional.  By  examining  the  Galerkin  approximate 
solution  (3.12),  we  see  that  the  approximate  solution  can  converge  only  as  N,M  -*  oo. 
This  would  result  in  an  infinite  number  of  ordinary  differential  equations.  Each  n 
value  included  in  the  approximate  solution  is  referred  to  as  adding  an  additional 
wavenumber  in  the  dynamic  model.  Even  for  just  one  wavenumber,  n  =  N  (say), 
we  see  the  assumed  Galerkin  solution  requires  an  infinite  number  of  terms  in  the 
y-direction, 

M~+oo 

<l>a{x,y,t)^  aNm{t)PNix)r„iy).  (4.1) 

m-0 

Interestingly,  by  examining  the  structure  of  the  A  matrix  in  the  resulting  state-space 
model,  we  see  that  the  dynamics  associated  with  each  individual  wavenumber  are 
decoupled  from  the  others.  This  is  characterized  by  a  block  diagonal  form  of  the  A 
matrix,  (4.2).  As  a  result,  we  may  separate  the  problem  of  determining  system  poles 
into  a  set  of  smaller  problems  that  include  only  one  wavenumber  at  a  time: 


[^(n=l)]  0 

0  [A{„=2)]  0 

0  0 

00  0  0 


00 

0 

0 

[^(n=oo)]  . 


(4.2) 


Indeed,  the  eigenvalues  (poles)  of  the  entire  A  matrix  are  simply  the  eigenvalues 
(poles)  of  each  smaller  A  matrix  block.  Offsetting  this  computational  advantage, 
however,  is  the  fact  that  the  locations  of  the  zeros  are  not  decoupled  by  wavenumber. 
Therefore,  even  though  the  poles  of  the  system  can  be  calculated  separately  using 
only  one  wavenumber  in  a  particular  computation,  the  zeros  of  the  system  force  all 
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the  wavenumbers  in  the  model  to  be  considered  together.  This  can  easily  be  seen 
from  an  examination  of  the  transfer  functions  associated  with  just  two  wavenumbers. 
Consider  two  transfer  functions  obtained  by  using  two  distinct  wavenumbers  and  the 
same  input,  u(t),  and  output,  z(t): 


N{„=l){s) 


H(„=2)  = 


N(n=2){s) 

D{„=2)is) 


(4.3) 


The  roots  of  N{s)  are  the  zeros  and  the  roots  of  D{s)  are  the  poles.  In  terms  of  input 
and  output. 


z  =  z  =  H^„=2)U.  (4.4) 

Considered  together,  however. 


Z  =  [l?(n=l)  +  H^„=2)]  U  = 


D{n=2){s)N^„=i)(s)  +  D(n^l){s)N^n=2){s) 
D(n=i)is)D^„^2)is) 


u.  (4.5) 


The  poles  of  the  new,  combined  transfer  function  are  clearly  those  of  each  model 
separately.  The  zeros,  however,  are  the  roots  of  a  completely  new  polynomial 
D^„=2)is)N^„=l)(s)  +  I>(n=i)(s)N(„=2)(s),  which,  in  general,  has  little  to  do  with  either  of 
the  original  numerators. 

We  have  seen  that  an  infinite  expansion  is  needed  to  fully  model  the  channel  system. 
Since  in  practice  an  infinite  expansion  cannot  be  used  to  create  a  state-space  model,  we 
will  always  have  some  part  of  the  system  dynamics  that  is  unmodelled.  Furthermore, 
we  have  seen  that  the  A  (dynamic)  matrix  may  be  decoupled  by  wavenumber. 

Consider  a  partition  of  the  state-space  model  as 


r 

dx 

L  dt  J 


A„  0 
0  A, 


Xu 


+ 


Bu 


u{t). 


(4.6) 


z=[C„  Cu]x,  (4.7) 

where  the  subscripts  m  and  u  represent  the  modelled  and  unmodelled  parts  of  the 
system.  In  the  channel  flow  problem,  the  unmodelled  part  is  meant  to  denote  only 
the  dynamics  of  wavenumbers  left  out  of  the  finite-dimensional  model.  Note  that 
both  Au  and  A„  are  of  infinite  dimension;  Au  because  of  the  infinite  number  of 
wavenumbers  left  out  of  the  reduced-order  model  and  A„  because  of  the  infinite 
number  of  expansion  functions  needed  in  y  for  each  of  the  finite  number  of  modelled 
wavenumbers.  One  way  to  avoid  considering  unmodelled  wavenumber  dynamics  is 
to  ensure  that  the  control  input,  u{t),  has  no  effect  whatsoever  on  the  unmodelled 
wavenumber  dynamics.  In  terms  of  the  state-space  model  (4.6),  this  is  equivalent  to 
rendering  Bu  =  0.  This  is  known  as  making  the  unmodelled  wavenumber  dynamics 
uncontrollable.  By  examining  (3.27),  we  see  that  the  B  matrix  is  formed  from  terms 
of  S,[  in  (3.16).  If  w(x)  in  (3.16)  is  such  that  S,[  =  0,  then  those  components  of  the 
B  matrix  are  zero.  Equivalently,  we  must  ensure  that  the  projection  of  w(x)  onto  the 
unmodelled  wavenumbers  is  zero.  Owing  to  the  orthogonality  of  Fourier  components, 
we  select  w(x)  to  be  made  up  of  modelled  Fourier  components  only. 


5.  Single-wavenumber  channel  model 

Consider  the  channel  model  shown  in  figure  1.  The  total  non-dimensional  length  of 
the  channel,  L,  is  4n.  In  this  case,  the  fundamental  wavenumber,  oo  =  0.5.  Only  one 
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◄ -  L  =  4k  - ► 


-div(x)/djc  =  -cos(x)  Shear  sensor  at  jc  =  +n/2 


Rgure  1.  System  model  for  Poiseuille  flow.  Input  is  applied  along  the  bottom  plate  as 
w(x)  =  sin(x)  and  shear  is  measured  at  x  =  n/l  with  Re  =  10000  and  a  =  1.0. 


wavenumber  is  included  in  the  model,  corresponding  to  a  =  loo  =  1.00.  The  Reynolds 
number  is  chosen  as  Re  =  10000.  Input  is  distributed  along  the  bottom  plate 
with  a  sinusoidal  weighting  function  in  order  to  render  the  unmodelled  wavenumber 
dynamics  uncontrollable  as  described  in  §4.  In  terms  of  the  input  function,  w(x),  see 
(2.12) 

w(x)  =  sin(x).  (5.1) 

Note  that  the  physical  blowing/suction,  v(x,y,t),  takes  the  form 

vix,y,t)  =  =  -1)  =  -9(f)cos(x)/(y  =  -1)  (5.2) 

see  (2.21).  This  type  of  input  may  be  achieved  in  practice  by  a  large  number  of 
independently  controllable  actuators  that  are  distributed  along  the  lower  channel 
wall.  The  f{y)  function  is  chosen  as  in  (2.20).  The  sensor  location  is  x  =  +jn.  In 
order  to  visualize  the  control-theoretic  model,  the  system  A,  B,  and  C  matrices  are 
transformed  to  transfer  function  form,  H{s).  Figure  2  shows  the  locations  of  the  poles 
and  zeros  in  the  s-plane  for  the  single-wavenumber  model. 


5.1.  Relation  of  poles  to  eigenvalues  of  the  Orr-Sommerfeld  equation 

Poles  of  the  transfer  function  or,  equivalently,  eigenvalues  of  the  A  matrix  are  closely 
related  to  the  eigenvalues  of  the  Orr-Sommerfeld  equation.  Assume  a  solution  of 
(2.6)  of  the  form 

V(x,y,0  =  ^(>')e““e-“^‘.  (5.3) 

By  substituting  into  (2.6),  we  obtain  the  familiar  Orr-Sommerfeld  equation  in  the 
normal-mode  form. 


[U(y)  -  c]  -  cc^Piy)^ 


d^y) 

dy'^ 


P{y)  = 


1 

iaRe 


dy* 


-2oc^ 


d^m 

dy'^ 


+«*Piy)  . 

(5.4) 


Here,  the  Reynolds  number.  Re,  is  known;  the  wavenumber,  a,  is  assumed  real  and 
known;  the  complex  wave  speed,  c,  is  the  eigenvalue  of  the  problem;  and  the  function 
Piy)  is  the  eigenvector  of  the  problem.  Stability  of  a  flow  for  a  given  value  of  Re 
and  a  is  determined  by  the  imaginary  part  of  c.  If  the  imaginary  part  is  positive,  the 
solution  (5.3)  becomes  an  unbounded  exponential  and  flow  is  unstable.  Poles  of  the 
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figure  2.  Pole  (x)  and  zero  (o)  configuration  for  the  system  model  of  figure  1.  Note  that  all  poles 
and  zeros  appear  in  complex-conjugate  pairs.  Each  pair  of  complex-conjugate  poles  represents  one 
mode  of  the  system.  One  pair  of  poles  is  to  the  right  of  the  imaginary  axis.  This  indicates  the  system 
has  one  unstable  mode.  Note  that  the  single  pole  at  the  origin  represents  a  built-in  integrator  due 
to  u(t)  =  dq{t)/dt.  Channel  model:  Re  =  10000,  shear  sensor  at  it/2,w(x)  —  sin(x),  L  =  An,  a  =  1.0. 


transfer  function  in  the  Laplace  domain  at  s  =  p,  correspond  to  solutions  in  the  time 
domain  of  e^''.  As  a  result,  we  observe  that  poles  of  the  transfer  function  are  related 
to  eigenvalues  of  the  Orr-Sommerfeld  equation  as 

Pi  =  -iac.  (5.5) 

In  order  to  validate  our  linear  code,  we  compare  our  poles  to  eigenvalues  produced  in 
Orszag  (1971).  Orszag  (1971)  obtained  Orr-Sommerfeld  eigenvalues  for  the  channel 
problem  with  a  =  1.00  and  Re  =  10000.  He  reported  only  one  slightly  unstable 
eigenvalue  at  s  =  0.00373967  -f-  iO.23752649  (c  =  0.23752649  -  i0.00373967).  The 
eigenvalue  is  seen  as  unstable  by  its  positive  real  part.  We  obtained  identical  results 
including  the  one  unstable  mode  at 

s  =  0.00373967  T  iO.23752649. 

All  other  stable  modes  obtained  in  the  present  study  are  identical  to  those  reported 
in  Orszag  (1971).  The  goal  of  our  control  system  will  be  to  move  these  unstable  poles 
into  the  stable  half  of  the  s-plane  or,  equivalently,  make  sure  the  controlled-system 
poles  all  have  real  parts  less  than  zero. 

5.2.  Verification  of  model  zeros 

Verification  of  the  system  zeros  is  more  difiScult  due  to  the  fact  that  no  published 
results  exist  to  our  knowledge.  We  use  the  channel  code  simulation  used  in  Kim, 
Moin  &  Moser  (1987)  to  verify  our  zeros.  This  code  is  a  spectral  channel  flow  code 
which  uses  periodic  boundary  conditions.  Consider  the  transfer  function  model  (3.2) 
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Rgure  3.  Output  from  the  Navier-Stokes  simulation  with  input  applied  as  u(t)  = 

Although  an  unbounded  input  is  applied,  near  zero  output  is  observed  at  sensor  location  n/2  (solid 
curve).  Sensor  location  0  (dashed  curve)  and  sensor  location  n  (dash-dot  curve)  do  not  show  zero 
output.  Channel  model:  Re  =  10000,  shear  sensor  at  n/2,w{x)  —  sin(x),  L  =  47r,  a  =  1.0. 


with  J  =  1  and  1  =  1: 


In  the  Laplace  domain, 


His)  = 


Zjs) 

U(s) 


is -Cl) 
(s-piY 


(5.6) 


sZ(s)-piZ(s)  =  sl/(s)-Cil/(s).  (5.7) 

Assuming  zero  initial  conditions  and  transforming  into  the  time  domain,  (5.7)  becomes 

dz(f)  / ,  du{t)  ^  . , 

-^-p,z(0=-5^-W<)  (5.8) 

where  z(t)  is  the  output  function  and  «(!)  is  the  input  function.  If  the  input  is  taken 
as 

«(0  =  6^“  (5.9) 

then  the  right-hand  side  of  (5.8)  becomes  zero  and  the  system  behaves  as  if  zero 
input  has  been  applied.  As  a  result,  z{t)  remains  zero  for  all  time.  Therefore,  we  may 
verify  zeros  of  the  system  by  applying  non-zero  input  in  special  ways  and  observing 
zero  output.  Note  from  figure  2  that  for  sensor  location  x  =  +i7t,  one  zero  exists 
in  the  right  half  plane  at  s  =  -1-0.317557  +  Oi.  This  represents  an  ideal  zero  to  check 
as  this  corresponds  to  an  unbounded,  unstable  input.  Figure  3  shows  output  from 
the  Navier-Stokes  simulation  with  input  u{t)  =  dq(t)/dt  =  gygjj 

though  an  unbounded,  exponentially  growing  input  is  applied  to  the  system,  near 
zero  output  is  observed  at  the  sensor,  thus  verifying  that  h  is  indeed  a  zero  of  the 
system.  This  is  contrasted  with  measurements  at  different  sensor  locations  that  grow 
rapidly.  With  a  sensor  at  a  different  location,  the  zeros  of  the  system  change  position 
so  zero  output  is  no  longer  expected  for  this  particular  input.  Absolute  zero  is  not 
observed  at  sensor  location  x  =  due  to  slight  numerical  inaccuracies  in  the  model. 


6.  Feedback  control  and  stabilization 

At  this  point,  with  the  construction  of  a  valid  state-variable  model,  any  number  of 
control  schemes  may  be  employed  to  stabiUze  the  system.  A  general  control  system  is 
shown  in  figure  4.  The  output  of  the  system  is  fed  into  a  controller.  The  output  of  the 
controller  is  then  used  to  create  an  input  to  the  system.  The  design  of  a  controller  that 
achieves  certain  system  characteristics  is  the  goal  of  control  system  design.  Several 
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modern  control  techniques  may  be  applied  that  require  a  state-variable  model.  In  this 
paper,  the  primary  goal  will  be  system  stability.  For  this  purpose,  a  simple  constant 
gain  feedback  with  integral  compensator  will  be  shown  to  be  suflScient. 


6.1.  Constant  gain  feedback  with  integral  compensator 

Figure  5  shows  an  integral  compensator  feedback  control  scheme.  K  is  referred  to  as 
the  gain  of  the  feedback.  The  output  of  the  system,  in  this  case  shear,  is  multiplied 
by  a  feedback  gain,  integrated  in  time,  and  then  fed  back  as  blowing  and  suction  at 
the  input.  Note  that  the  integrator  is  required  because  of  the  structure  of  the  input 
given  in  (3.27),  i.e.  the  input  is  taken  as  the  derivative  of  suction/blowing.  The  signal, 
r(r),  is  called  the  reference  signal.  It  is  used  to  define  the  desired  output.  In  our  case, 
we  would  like  the  shear  output,  z{Xi,y  =  -l,r),  to  be  zero.  Therefore,  we  set  r(r)  to 
zero.  Then 

u(t)  =  ^^  =  -Kz{xi,y==-l,t).  (6.1) 

Therefore,  assuming  g(0)  =  0, 


and  by  observing  (2.21),  we  may  describe  physical  blowing  and  suction  at  the  bound¬ 
ary: 


^x,y  =  —1,0  =  —  ~1)  =  ^ cos(x)  I  z{Xi,y  —  —l,z)dx  (6.3) 


As  defined  earlier. 


Therefore, 


A  j5P[z(0]  ^  Z(s) 
JSf[u(0]  U(sy 


(6.4) 


Z(s)  =  His)U{s).  (6.5) 

In  the  absence  of  feedback,  one  mode  is  unstable.  Also,  one  pole  exists  at  the  origin 
for  the  integrator.  Consider  a  new  transfer  function  from  the  reference  input,  r(t),  to 
the  output,  z{t),  in  the  presence  of  feedback: 


uit)  =  r(t)-Kzit).  (6.6) 

By  taking  the  Laplace  transform  of  (6.6), 


Then 


Uis)=^R(s)-KZis). 


Finally, 


Z(s)  =  ff(s)[R(s)-KZ(s)]. 


Zjs)  _  H{s) 

R{s)  l-KH{sy 

The  new  poles  of  the  feedback  system  are  defined  by 


(6.7) 

(6.8) 
(6.9) 


l-KH{s)  =  0.  (6.10) 

As  K  gets  larger  and  larger,  it  is  clear  that  the  poles  of  the  new  system  tend  toward 
the  zeros  of  H{s).  In  this  way,  modes  of  the  system  can  sometimes  be  changed  to 
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Rgure  4.  Feedback  control.  The  output  of  the  system  is  fed  into  a  controller  and  then  to  the  input. 


Feedback  gain 


Rgure  5.  Feedback  control  for  channel  system.  The  output  of  the  system  is  multiplied  by  a 
feedback  gain,  integrated  in  time,  and  then  fed  back  at  the  input.  The  reference  signal,  r(0, 
equals  0. 

form  a  stable  system  (Franklin  et  al.  1988).  Unstable  modes  that  appear  as  poles 
on  the  right  hand  side  of  the  complex  s-plane  and  ‘marginally  stable’  poles  on  the 
Im(s)-axis  are  drawn  to  the  left-hand  side  by  applying  feedback. 

6.2.  Sensor  placement 

We  have  seen  that,  by  applying  feedback,  poles  of  the  system  will  eventually  be  drawn 
to  zeros  of  the  system.  In  the  channel  flow  system  of  flgure  2,  any  feedback  will 
cause  either  the  pole  at  the  origin  (integrator  pole)  or  the  unstable  mode  poles  to  be 
drawn  to  the  zero  on  the  real  axis  in  the  right-hand  plane,  thus  making  the  system 
unstable  to  a  greater  degree.  Therefore,  finding  a  transfer  function  that  has  all  zeros 
in  the  left-hand  plane  becomes  an  important  objective.  The  poles  of  the  system  are 
independent  of  sensing  or  actuation.  However,  the  zeros  of  the  system  are  dependent 
on  both  the  type  and  location  of  sensing  and  the  type  and  location  of  actuation. 
Figure  6  shows  the  pole/zero  configuration  for  the  channel  model  with  the  shear 
sensor  at  three  different  locations.  Only  the  top  half  of  the  s-plane  is  shown  since 
the  bottom  half  is  a  mirror  image  projected  across  the  real  axis  as  shown  in  figure  2. 
The  poles  of  all  the  models  are  in  the  same  location  as  expected.  However,  the  zeros 
are  different  in  all  three  cases.  In  figures  6(a)  and  6(h),  we  observe  a  lone  zero  in 
the  right-hand  s-plane.  However,  when  the  sensor  is  placed  at  x  =  +n,  figure  6(c) 
shows  all  zeros  in  the  left-hand  plane.  In  fact,  there  is  a  region  around  x  =  tc  that 
results  in  all  zeros  in  the  left-hand  plane.  By  placing  a  shear  sensor  at  x  =  7t,  simple 
feedback  with  integral  compensation  will  allow  stabiUzation  with  the  proper  value 
of  gain,  K.  In  the  case  of  sensor  locations  that  result  in  right-hand-plane,  so  called 
‘non-minimum  phase’,  zeros  stabilization  is  still  possible.  However,  more  complex 
controllers  (Bryson  &  Ho  1975;  Ogata  1990)  must  be  designed  that  are  beyond  the 
scope  of  this  paper. 

6.3.  Root  locus  analysis  and  numerical  simulation 

One  way  to  visualize  how  system  poles  will  change  as  the  feedback  gain,  K,  changes 
is  to  construct  a  root  locus  plot.  This  is  a  plot  of  all  poles  of  a  system  as  the  feedback 
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Res 


FtGtJRE  6.  Pole  (x)/zero  (o)  configuration.  Channel  model;  Re  =  10000,  w(x)  =  sin(x),  L  =  47t, 
a  =  1.0.  Only  the  top  half  of  the  s-plane  is  shown.  Shear  sensor  at  (a)  n/4  (b)  3n/4,  (c)  n. 


gain  varies  from  K  =  0  to  K  =  oo.  Figure  7  shows  such  a  plot  for  the  system  shown  in 
figure  6(c).  When  K  reaches  0.1,  all  feedback  system  poles  (closed-loop-poles)  lie  on 
the  left-hand  plane  and  no  instabilities  exist  in  the  new  feedback  system.  Numerical 
results  obtained  from  the  Navier-Stokes  simulation  for  the  new  feedback  controlled 
system  are  shown  in  figure  8.  The  computation  is  carried  out  without  feedback  until 
t  =  50,  after  which  the  feedback  is  turned  on.  We  see  that  the  growing  instability  is 
quickly  suppressed.  At  the  instant  the  controller  is  turned  on,  the  simulation  shows 
a  high  transient  response  due  to  the  non-continuous  nature  of  the  input  at  that  time 
instant.  In  terms  of  the  state  space  defined  in  (3.28),  (3.35), 

.Bu(t), 

z(Xi,y  =  -\,t)  =  Cp, 

u{t)  = -Kz(Xi,y  = 

Then,  in  a  closed  loop, 

^  =  A/»  -  B{Kz(Xi,  y  =  -1,  f)) 

=  Ap-  BKCp  =  (A-  BKC)p 


(6.11) 

(6.12) 

(6.13) 

(6.14) 

(6.15) 
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Figure  7.  Root  locus  plot  for  channel  system  with  input  as  in  the  channel  model  (figure  1)  and 
shear  output  at  x  =  tt.  The  poles  start  at  the  open-loop  poles  shown  with  x’s.  As  K  increases,  they 
start  to  move  toward  the  location  of  the  system  zeros,  shown  as  o’s.  The  pole  at  position  s  =(0,0) 
moves  directly  to  the  left.  The  unstable  pole  moves  quickly  to  a  position  just  to  the  left  of  ^e 
imaginary  axis.  Near  0,7  <  y  <  1.0,  -0,6  <  x  <  -0.4,  we  see  a  pole  moving  towards  a  zero  that 
is  out  of  the  range  of  this  figure.  Channel  model:  Re  =  10000,  shear  sensor  at  tt,  w(x)  =  sin(x), 
L  =  471,  a  =  1.0.  Only  the  top  half  of  the  s-plane  is  shown. 


figure  8.  Navier-Stokes  simulation  of  feedback  control.  Shear  is  multiplied  by  a  feedback  gain, 
integrated,  and  fed  back  into  the  input.  Channel  model:  Re  =  10000,  feedback  shear  sensor  at  ti, 
w(x)  =  sin(x),  L  =  47c,  a  =  1.0.  Curve  1  (solid)  shows  shear  at  location  n\  curve  2  (dot-dash)  shows 
shear  at  location  n/2. 

The  solution  to  this  equation  is 

Pit)  =  =  0).  (6.16) 

This  solution  will  approach  zero  as  t  -»  oo  since  the  eigenvalues  of  A  — BKC  (closed- 
loop  poles)  are  all  stable. 

6.4.  Robustness  in  the  presence  of  Reynolds  number  uncertainty 

A  major  advantage  of  feedback  control  systems  is  their  robustness  to  system  un¬ 
certainty.  From  a  practical  point  of  view,  Reynolds  numbers  may  not  be  known 
exactly  or  may  change  frequently  as  in  the  flight  of  an  airplane,  for  example. 
Figure  9  shows  the  open-loop  pole/zero  configurations  for  systems  with  varying 
Reynolds  numbers.  Note  that  these  systems  start  with  all  zeros  in  the  left-hand 
s-plane.  A  root  locus  analysis  shows  that  a  feedback  system  with  K  =  0.1  stabilizes 
both  systems.  Indeed,  a  feedback  gain  of  X  =0.1  stabilizes  systems  for  a  wide 
range  of  Reynolds  numbers  from  Re  =  1000  to  Re  =  40000.  Figure  10  shows 
the  least-stable  pole  in  both  the  controlled  and  uncontrolled  systems  for  several 
Reynolds  numbers.  Recall  that  a  pole  with  real  part  less  than  zero  is  stable.  We 
see  near  Re  =  5772,  an  unstable  pole  appears  in  the  open-loop  (uncontrolled) 
system.  Unstable  eigenvalues  continue  to  exist  in  the  open-loop  system  until 
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Rgure  9.  Pole  (x)/zero  (o)  configuration.  Channel  model;  shear  sensor  at  n,  w(x)  =  sin(x), 
L  =  4jt,  a  =  1.0.  Only  the  top  half  of  the  s-plane  is  shown,  (a)  Re  =  7500,  (b)  Re  =  12  500. 


Rgure  10.  Real  part  of  least-stable  pole  for  open-loop  (solid  line)  and  closed-loop  (dashed  line) 
channel  system  with  1000  <  Re  <  40000.  Shear  sensor  at  n,  w(x)  =  sin(x),  L  =  4?c,  a  =  1.0.  Data 
points  are  shown  as  x’s. 


30000  <  Re  <  35(X)0.  An  identical  feedback  controller,  with  gain  K  =  0.1, 
however,  stabilizes  the  system  for  Reynolds  numbers  in  the  range  1000  <  Re  < 
40000.  Clearly,  the  feedback  controller  is  extremely  robust  to  changes  in  Reynolds 
number. 
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7.  Unobservable  transient  response  as  a  path  of  bypass  transition 

Recently,  some  authors  (Trefethen  et  al.  1993;  Butler  &  Farrell  1992;  Farrell  1988; 
Henningson  1994)  have  suggested  a  possible  path  of  bypass  transition  in  Poiseuille 
flow  that  is  caused  by  high  transient  response  due  to  the  non-self-adjointness  of  the 
evolution  matrix.  This  high-transient  behaviour  is  due  to  the  non-orthogonality  of  the 
eigenmodes  and  not  caused  by  any  single  mode,  but  rather  a  combination  of  many 
modes.  In  this  section,  we  suggest  a  possible  different  form  of  bypass  transition  in  a 
controlled  system  due  to  a  single  mode.  In  the  presence  of  any  type  of  control,  it  is 
possible  that  energy  from  an  input  is  fed  into  a  mode  such  that  the  mode  is  reinforced 
during  the  transient  period  before  it  eventually  dies  out.  During  the  transient  period, 
however,  the  mode  may  grow  to  an  amplitude  large  enough  to  trigger  nonlinear 
effects  that  induce  transition  to  turbulence.  This  may  be  a  possible  path  of  bypass 
transition  in  a  controlled  system. 

Single  modes  may  be  characterized  in  linear  systems  in  terms  of  modal  controlla¬ 
bility  and  modal  observability.  Modal  controllability  implies  that  a  particular  mode 
may  be  affected  by  the  actuators  chosen  (blowing/suction  in  our  case).  Modal  ob¬ 
servability  implies  that  a  particular  mode  may  be  measured  by  the  sensors  chosen 
(shear  in  our  case).  A  particular  mode  may  be  non-observable,  non-controllable,  or 
both.  This  will  be  seen  in  §§7.1  and  7.2.  If  high-transient  modes  are  controllable  and 
observable,  control  theory  may  be  used  to  suppress  them.  However,  if  high-transient 
modes  are  unobservable,  the  high  transients  will  never  be  seen  and  feedback  control 
cannot  be  used  to  suppress  them. 


7.1.  Modal  canonical  form 

The  state-space  formulation  provides  an  excellent  framework  for  assessing  the  rein¬ 
forcement  of  each  mode  individually.  Consider  an  n-dimensional  state  space  with 
scalar  input  and  output 

^  =  Ax  +  Bu,  (7.1) 

z  =  Cx.  (7.2) 

We  may  perform  a  similarity  transformation  on  the  system  to  produce  a  new  state- 
space  representation  with  the  same  input-output  relationship,  but  with  a  different 
interpretation  of  the  state  variables.  Let 


P=[v,  V2  ...  v„  ]  (7.3) 

where  is  the  jth  eigenvector  of  the  A  matrix  and  n  is  the  dimension  of  the  A  matrix. 
A  new  representation  is  constructed  as  (Grace  et  al.  1992;  Kailath  1980) 


^  =  P  ^APx  +  P  ^Bu, 
at 

z  =  CPx 


where  x  =  P  *x.  The  new  representation  is  written  as 


dx 
dt 
z  =  Cx 


Ax  +  Bu, 


(7.4) 

(7.5) 

(7.6) 

(7.7) 


where  the  real  eigenvalues  of  the  original  A  matrix  appear  on  the  diagonal  of  A  and 
the  complex  eigenvalues  appear  in  a  2  x  2  block  on  the  diagonal  of  A.  For  an  A 
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matrix  with  eigenvalues  (ai,(7  +  jco.aa),  the  A  matrix  is 


■  ai  0  0  0  " 

0  <T  (U  0 

0  —CO  a  0 

.  0  0  0  a2  . 


(7.8) 


In  this  form,  each  state-variable  pair  represents  a  mode  of  the  system.  Furthermore, 
the  modes  are  block  decoupled  so  that  each  mode  is  represented  by  the  2  x  2  system 


where  co,  a,  bj,  bi,  cj,  and  C2  are  all  scalars.  This  is  known  as  the  modal  canonical 
state-space  form.  All  modes  can  be  monitored  directly  for  high-transient  reinforcement 
in  this  form. 


7.2.  Modal  observability  and  controllability 

It  is  easy  to  see  that  if  hi  and  b2  are  zero,  no  input  can  affect  the  mode  and  the 
mode  is  uncontrollable.  Similarly,  if  ci  and  C2  are  zero,  then  the  motion  of  5ci  and 
X2  cannot  be  measured  at  the  output,  zu  and  the  mode  is  unobservable.  In  terms 
of  stabilization,  an  uncontrollable  mode  is  not  problematic  as  long  as  it  is  stable. 
However,  if  an  unstable  mode  is  uncontrollable,  nothing  can  be  done  to  stabilize  it. 
Observability  may  be  a  problem  even  if  modes  are  stable.  If  an  unobservable  mode 
is  highly  amplified  by  control  energy,  no  attempt  could  be  made  to  suppress  it  since 
it  would  not  be  observed  at  the  output.  In  terms  of  poles  and  zeros,  uncontrollable 
or  unobservable  modes  both  show  up  as  pole/zero  cancellations  in  the  complex  s- 
plane.  As  can  be  seen  from  figure  2,  many  poles  and  zeros  for  the  channel  system 
lie  on  top  of  each  other,  indicating  that  certain  modes  in  the  system  are  either 
uncontrollable,  unobservable,  or  both.  Although  a  mode  may  not  be  physically 
observable  at  the  output,  the  modal  canonical  formulation  allows  us  to  numerically 
observe  the  state  evolution  of  each  mode  directly.  In  this  way,  we  may  assess  the  risk 
of  highly  amplified  modes  triggering  bypass  transition.  This  is  done  for  the  controlled 
system  simulated  in  figure  8.  The  uncontrolled  linear  system  is  simulated  with  an 
initial  condition  for  200  time  steps  at  which  time  a  feedback  controller  with  gain 
K  =  0.1  is  turned  on.  Figure  11  shows  the  state  evolution  of  the  modes  with  poles 
at  s  =  —0.1474  ±  0.85 14i  and  s  =  —0.3252  +  0.636 li  as  well  as  the  state  evolution  of 
the  one  unstable  mode.f  In  addition,  figure  ll(o)  shows  the  shear  measurement  at 
the  output.  We  see  that  the  unstable  mode  dies  out  quickly  as  soon  as  the  feedback 
is  turned  on.  In  addition,  the  mode  at  s  =  —0.1474  ±  0.85 14i  gains  almost  no  energy 
after  the  feedback  is  activated.  This  indicates  that  the  mode  is  not  a  bypass  mode. 
The  mode  at  s  =  — 0.3252  ±0.6361i  is  seen  to  gain  a  lot  of  energy  after  the  controller 
is  started.  Indeed,  the  amplitude  of  the  transient  response  is  nearly  twice  that  of 
the  unstable  mode.  This  represents  a  possible  bypass  mode  since  such  an  amplitude 

t  Note  that  amplitudes  shown  as  a  result  of  linear  system  simulations  should  only  be  used  for 
comparison  with  each  other  since  the  linear  model  has  been  scaled  to  unity  open-loop  feedforward 
gain,  i.e.  H{s)  =  kZ{s)/U{s)  where  k  =  1. 
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Rgure  11.  Linear  model  state  evolution  of  system  modes  with  control  applied  at  r  =  200:  (a) 
measured  shear,  (b)  the  state  evolution  of  the  unstable  mode,  (c)  the  state  evolution  of  the  mode 
at  s  =  — 0.1474 +  0.8514i,  (d)  the  state  evolution  of  the  mode  at  s  =  —0.3252  +  0.6361i.  Channel 
model:  Re  =  10000,  shear  sensor  at  n,  w(x)  =  sin(x),  L  =  4jt,  a  =  1.0. 


may  force  the  system  into  the  nonlinear  region  where  transition  to  turbulence  may  be 
triggered.  Furthermore,  the  high  energy  nature  of  the  mode  cannot  be  observed  at  the 
shear  sensor  since  this  mode  is  unobservable.  This  can  be  seen  from  the  plot  of  the 
shear  sensor  output.  Modal  un-observability  is  also  suggested  by  noting  the  relatively 
low  and  C2  values  of  0.5155,  —0.1131  compared  to  the  ci  and  C2  values  of  —162.85, 
234.66  for  the  unstable  mode  which  is  clearly  seen  at  the  output.  Fortunately,  the  high 
energy  nature  of  the  mode  does  not  lead  to  a  bypass  transition  in  this  case.  This  is 
verified  by  the  Navier-Stokes  simulation  in  figure  8,  which  shows  no  nonlinear  elfects. 
If  the  instability  were  allowed  to  grow  to  a  higher  amplitude  before  the  controller 
was  applied,  however,  the  highly  amplified  mode  in  the  transient  response  might  have 
triggered  nonlinear  effects. 


8.  Multiple  instability  control 

Although  most  work  has  focused  on  suppression  of  single  instabilities,  more  realistic 
models  should  include  multiple  wavenumbers.  Indeed,  for  a  given  Reynolds  number, 
an  infinite  number  of  wavenumbers  exist.  Each  wavenumber  contributes  its  own  poles 
and  zeros  to  the  control-theoretic  model.  Consider  a  model  with  non-dimensional 
channel  length  20n,  where  input  is  applied  as  w(x)  =  sin(x)  -f-  sin(0.9x).  In  this  model, 
wavenumbers  of  0.9  and  1.0  are  included.  Both  wavenumbers  lead  to  unstable  modes. 
The  pole/zero  configuration  of  this  new  two- wavenumber  model  with  shear  sensor 
at  n  is  shown  in  figure  12(fl).  Note  that  in  comparison  to  the  one-wavenumber 
model,  more  poles  and  zeros  exist.  The  original  one-wavenumber  model  contained 
one  ‘fork’  structure  of  poles,  while  the  two-wavenumber  model  contains  two  ‘fork’ 
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FlGxmE  12.  (a)  Pole  (x)  and  zero  (o)  configuration  of  channel  system  of  length  20n,  including 
wavenumbers  of  0.90  and  1.00.  Re  —  10000,  shear  sensor  at  n,  w(x)  =  sin(x)  +  sin(0.9x).  Only  the 
top  half  of  the  s-plane  is  shown,  (b)  Same  as  (a)  but  showing  closed-loop  poles  after  feedback  with 
gain  K  =0.1. 


Rgure  13.  Navier-Stokes  simulation  of  channel  system  of  length  207t,  including  wavenumbers  of 
0.90  and  1.00.  Re  =  10000,  feedback  shear  sensor  at  jt,  w(x)  =  sin(x)  -I-  sin(0.9x),  feedback  gai" 
K  =  0.1.  Curve  1  (solid)  shows  shear  at  location  n;  curve  2  (dash-dot)  shows  shear  at  location  2w. 


structures.  It  can  be  visualized  that  in  a  model  with  several  wavenumbers,  several 
‘forks’  will  stack  on  top  of  each  other  in  the  s-plane.  Near  s  =  iO.2  are  two 
unstable  poles  in  the  right-hand  plane  that  represent  the  two  unstable  modes  in  the 
system.  As  seen  before,  all  zeros  lie  in  the  left  hand  s-plane.  Figure  12(b)  shows 
the  closed-loop  poles  after  feedback  with  gain  K  =  0.1.  Results  from  the  Navier- 
Stokes  simulation  are  shown  in  figure  13.  The  computation  is  carried  out  without 
feedback  until  t  =  200.  A  combination  of  two  growing  waves  is  seen  at  the  shear 
sensor.  At  t  =  200,  feedback  with  integral  compensation  is  applied  and  the  system  is 
stabilized. 
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9.  Effect  of  linear  controllers  on  two-dimensional  finite-amplitude 
disturbances 

The  critical  Reynolds  number  above  which  linear  instabilities  exist  in  plane 
Poiseuille  flow  is  Re  «  5772.  Below  this  Reynolds  number,  Poiseuille  flow  is  linearly 
stable.  However,  transition  from  laminar  to  turbulent  flow  occurs  in  experiments 
at  much  lower  Reynolds  numbers,  typically  around  Re  =  1000.  Moreover,  even 
for  super-critical  Reynolds  number  flows,  linear  instabilities  predict  extremely  slow 
growth  rates.  In  practice,  transition  occurs  orders  of  magnitude  more  quickly.  There¬ 
fore,  linear  stability  alone  does  not  dictate  transition.  Transition  to  turbulence  is 
generally  accepted  to  be  a  nonlinear  three-dimensional  phenomenon.  In  previous  sec¬ 
tions,  we  constructed  two-dimensional  linear  controllers  based  on  the  two-dimensional 
linearized  Navier-Stokes  equations.  By  applying  these  controllers  to  plane  Poiseuille 
flow  with  infinitesimal  two-dimensional  disturbances,  we  saw  that  we  may  linearly 
stabilize  the  system  so  that  the  flow  could  no  longer  support  those  disturbances.  If 
we  apply  our  linear  controllers  to  flows  that  contain  finite-amplitude  disturbances,  we 
can  no  longer  use  linear  analysis  to  describe  the  flow  dynamics  as  nonlinear  terms 
become  relevant.  However,  the  application  of  a  controller  based  on  linear  analysis 
does  change  the  nonlinear  system. 

Many  authors  (Orszag  &  Patera  1983;  Bayly,  Orszag  &  Herbert  1988)  have  explored 
the  effect  of  two-dimensional  finite-amplitude  disturbances  on  three-dimensional  in¬ 
finitesimal  disturbances  in  plane  Poiseuille  flow.  Above  Re  «  2900,  for  two  dimensions, 
stable  non-attenuating  finite-amplitude  equilibria  exist  in  plane  Poiseuille  flow.  Below 
Re  »  2900,  non-decaying,  finite-amplitude  equilibria  do  not  exist.  However,  in  flows 
with  1000  ^  Re  <  2900,  the  timescale  for  decay  is  so  large  that  the  flow  may  be 
considered  in  ‘quasi-equilibrium’.  It  has  been  shown  that  in  the  presence  of  such 
two-dimensional  finite-amplitude  disturbances,  infinitesimal  three-dimensional  distur¬ 
bances  are  highly  unstable  and  may  cause  transition  in  shear  flows.  Figure  14  shows 
the  energy  of  a  single-wavenumber,  two-dimensional  finite-amplitude  disturbance 
(a  =  1.0)  and  a  single-wavenumber-pair,  three-dimensional  infinitesimal  disturbance 
(a  =  1.0,  )8  =  +1.0)  obtained  through  direct  numerical  simulation  at  Re  =  3000.  The 
maximum  amplitude  of  the  two-dimensional  finite-amplitude  disturbance  is  O.Wc. 
At  this  Reynolds  number,  wavenumber,  and  initial  energy  level,  we  see  the  two- 
dimensional  finite-amplitude  disturbance  decaying  slowly.  The  three-dimensional  dis¬ 
turbance,  on  the  other  hand,  is  seen  to  rapidly  gain  energy.  Orszag  &  Patera  (1983) 
show  that  the  two-dimensional  instability  acts  as  a  mediator  for  transfer  of  energy 
from  the  mean  flow  to  the  three-dimensional  disturbance,  but  does  not  directly  provide 
energy.  The  growth  rate  of  the  three-dimensional  disturbance  is  orders  of  magnitude 
larger  than  that  of  the  Orr-Sommerfeld  instabilities. 

Orszag  &  Patera  show,  in  the  case  where  Re  <  1000,  that  the  attenuation  of 
finite-amplitude  two-dimensional  disturbances  is  large  enough  that  three-dimensional 
disturbances  do  not  become  unstable.  The  fact  that  high  attenuation  of  the  two- 
dimensional  finite-amplitude  disturbance  prevented  three-dimensional  instability  be¬ 
low  Re  »  1000  suggests  that  if  controllers  can  be  created  that  speed  up  the  attenuation 
of  the  finite-amplitude  two-dimensional  disturbance  for  flows  with  Reynolds  num¬ 
bers  greater  than  1000,  three-dimensional  instability  may  be  eliminated.  We  have 
seen  in  §6  that  linear  controllers  did  stabilize  infinitesimal  two-dimensional  distur¬ 
bances.  Figure  15  shows  the  effects  of  the  linear  controller  of  §6  when  applied 
to  a  system  with  the  same  finite-amplitude  two-dimensional  disturbance  and  in- 
fiaiitesimal  three-dimensional  disturbance  shown  in  figure  14.  The  linear  controller 
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Rgure  14.  Energy  of  a  two-dimensional  finite-amplitude  disturbance  (a  =  1.0)  and  a 
three-dimensional  infinitesimal  disturbance  (a  =  1.0,  j?  =  Tl-0)  for  Re  =  3000,  Solid  line  rep¬ 
resents  the  total  energy  of  the  two-dimensional  finite-amplitude  disturbance  and  the  dotted  line 
represents  that  of  the  three-dimensional  infinitesimal  disturbance. 


Time 

figure  15.  Effect  of  linear  controller  when  applied  at  t  =  15  for  Ke  =  3000.  The  top  two  lines 
show  the  two-dimensional  finite-amplitude  disturbance  without  (solid)  and  with  (dash-dot)  linear 
control.  We  see  at  t  =  15  that  the  controlled  two-dimensional  finite-amplitude  disturbance  is  highly 
damped.  The  bottom  two  lines  show  the  infinitesimal  three-dimensional  disturbance  without  (dot) 
and  with  (dashed)  linear  control.  That  without  controller  gains  energy  quickly,  while  that  with 
controller  quickly  fails  to  gain  energy. 


Systems  theory  approach  to  feedback  stabilization  of  plane  Poiseuille  flow  183 

dramatically  increased  the  attenuation  of  the  finite-amplitude  two-dimensional  dis¬ 
turbance.  As  a  result,  the  infinitesimal  three-dimensional  disturbance  was  rendered 
stable.  Clearly,  the  linear  controller  not  only  stabilized  the  two-dimensional  hn- 
ear  system,  but  also  had  a  stabilizing  effect  on  the  three-dimensional  nonlinear 
system.  These  results  show  the  promise  of  linear  controllers  even  in  nonlinear  sys¬ 
tems. 


10.  Conclusions 

In  this  paper,  we  have  developed  feedback  controllers  that  linearly  stabilize 
plane  Poiseuille  flow.  We  used  a  Galerkin  spectral  method  to  generate  state-space 
models.  Indeed,  any  (convergent)  numerical  method  that  reduces  the  governing  par¬ 
tial  differential  equations  into  a  set  of  ordinary-differential  equations  may  be  used. 
It  is  important  to  note,  however,  that  the  meaning  of  the  state  variables  in  the  state- 
space  model  changes  as  different  numerical  methods  are  employed.  Even  though 
state  variables  may  not  have  any  specific  physical  meaning,  some  numerical  meth¬ 
ods  may  result  in  favourable  divisions  of  system  dynamics.  In  the  plane  Poiseuille 
flow  case,  the  spectral  Galerkin  method  with  the  use  of  Fourier  components  in  the 
x-direction  and  combination-Chebyshev  polynomials  in  the  y-direction  led  to  mod¬ 
elled  dynamics  that  were  de-coupled  by  wavenumber.  This  led  to  a  block  diagonal 
form  of  the  A  matrix.  As  a  result,  we  were  clearly  able  to  describe  modelled  and 
unmodelled  dynamics  in  terms  of  wavenumber  dynamics  included  and  not  included 
in  our  finite-dimensional  model.  This  also  led  us  to  the  concept  of  using  distributed 
control  to  render  certain  wavenumbers  ‘uncontrollable’.  If  other  numerical  methods 
had  been  used,  this  concept  would  not  have  been  so  transparent.  Furthermore, 
plane  Poiseuille  flow  can  be  linearly  stabilized  with  simple  feedback  controllers  if 
sensors  are  placed  at  judicious  locations.  Both  the  position  and  the  type  of  sensing 
and  actuation  change  the  zeros  of  single-input/single-output  models.  In  our  case, 
we  have  seen  that  certain  shear  sensor  locations  lead  to  ‘minimum-phase’  systems 
and  some  locations  lead  to  ‘non-minimum  phase’  systems.  Minimum-phase  systems 
are  in  general  easier  to  control  than  non-minimum  phase  systems.  As  a  result,  we 
were  able  to  stabilize  plane  Poiseuille  flow  with  a  constant  gain  feedback,  integral 
compensator  controller.  In  addition,  the  controller  was  extremely  robust  to  a  wide 
range  of  Reynolds  numbers.  Also,  we  have  shown  that  one  danger  of  feedback 
control  is  that  the  linear  transient  response  of  a  controlled  system  can  lead  to  high 
amplitudes  for  a  short  period  of  time.  If  these  amplitudes  are  high  enough,  it  is 
possible  that  they  may  invalidate  the  linear  model  and  enhance  nonlinear  effects. 
Furthermore,  the  high  transients  may  not  be  observable  at  the  output  so  that  feed¬ 
back  control  cannot  be  used  to  suppress  them.  Finally,  we  have  shown  that  linear 
controllers  have  a  strong  stabilizing  effect  on  two-dimensional  finite-amplitude  dis¬ 
turbances.  As  a  result,  three-dimensional  secondary  instabilities  can  be  rendered 
stable. 
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A  new  adaptive  controller  based  on  a  neural  network  was  constructed  and  applied  to  turbulent 
channel  flow  for  drag  reduction.  A  simple  control  network,  which  employs  blowing  and  suction  at 
the  wall  based  only  on  the  wall-shear  stresses  in  the  spanwise  direction,  was  shown  to  reduce  the 
skin  friction  by  as  much  as  20%  in  direct  numerical  simulations  of  a  low-Reynolds  number  turbulent 
channel  flow.  Also,  a  stable  pattern  was  observed  in  the  distribution  of  weights  associated  with  the 
neural  network.  This  allowed  us  to  derive  a  simple  control  scheme  that  produced  the  same  amount 
of  drag  reduction.  This  simple  control  scheme  generates  optimum  wall  blowing  and  suction 
proportional  to  a  local  sum  of  the  wall-shear  stress  in  the  spanwise  direction.  The  distribution  of 
corresponding  weights  is  simple  and  localized,  thus  making  real  implementation  relatively  easy. 
Turbulence  characteristics  and  relevant  practical  issues  zire  also  discussed.  ©  1997  American 
Institute  of  Physics.  [81070-6631(97)02706-2] 


L  INTRODUCTION 

The  ability  to  control  turbulent  flows  is  of  significant 
economic  interest.  Successful  control  of  turbulent  boundary 
layers  by  reducing  drag,  for  example,  can  result  in  a  substan¬ 
tial  reduction  in  operational  cost  for  commercial  aircraft  and 
marine  vehicles.  Recent  studies^’^  show  that  near-wall 
streamwise  vortices  are  responsible  for  high  skin-friction 
drag  in  turbulent  boundary  layers.  Some  attempts  have  been 
made  to  reduce  the  skin-friction  drag  by  controlling  the  in¬ 
teractions  between  these  vortices  and  the  wall.  Choi  et  al^ 
for  example,  used  blowing  and  suction  at  the  wall  equal  and 
opposite  to  the  wall-normal  component  of  velocity  at 
^  +  _  10  jYiey  showed  that  this  control  effectively  mitigated 
the  streamwise  vortices,  giving  approximately  25%  drag  re¬ 
duction  in  a  turbulent  channel  flow.  Although  the  method 
employed  in  their  work  is  impractical,  since  the  information 
at  =  10  is  usually  not  available,  it  demonstrates  a  control 
scheme  which  reduces  skin-friction  drag  by  manipulation  of 
the  near-wall  streamwise  vortices.  Another  way  to  control 
the  streamwise  vortices  is  to  impose  spanwise  oscillation  by 
a  moving  wall  or  externally  imposed  body  force.^’"^  These 
methods,  however,  require  a  large  amount  of  energy  input. 

A  systematic  approach  using  suboptimal  control  theory 
has  also  been  tried  in  the  past.  This  approach,  which  attempts 
to  minimize  a  cost  function,  was  successfully  applied  to  the 
stochastic  Burgers  equation.^  Moin  and  Bewley^  applied  a 
similar  approach  to  a  turbulent  channel  flow  to  achieve  up  to 
50%  drag  reduction.  The  control,  however,  requires  informa¬ 
tion  from  the  entire  velocity  field  inside  the  flow  domain  and 
excessive  computational  time,  making  it  impractical  to 
implement  in  real  situations.  For  practical  implementation,  a 
control  scheme  should  utilize  only  quantities  that  are  easily 

•^Corresponding  author:  Telephone:  (310)825-4393;  Fax:  (310)206-4830; 

Electronic  mail:  jkim@turb.seas.ucla.edu 


measurable  at  the  wall,  and  should  be  fast  enough  to  be 
applied  in  real  time. 

The  objective  of  the  present  work  is  to  seek  wall  actua¬ 
tions,  in  the  form  of  blowing  and  suction  at  the  wall,  depen¬ 
dent  on  the  wall-shear  stress  to  achieve  a  substantial  skin- 
friction  reduction.  This  requires  knowledge  of  how  the  wall- 
shear  stresses  respond  to  wall  actuations,  i.e.,  the  correlation 
between  the  wall-shear  stresses  and  the  wall  actuations.  Be¬ 
cause  of  the  complexity  of  the  solutions  to  the  Navier- 
Stokes  equations,  however,  it  is  not  possible  to  find  such  a 
correlation  in  closed  form  or  to  approximate  it  in  simple 
form.  Instead,  we  use  a  neural  network  to  approximate  the 
correlation  which  then  predicts  the  optimal  wall  actuations  to 
achieve  the  minimum  value  of  the  skin-friction  drag.  Neural 
networks  have  been  used  to  obtain  complicated,  nonlinear 
correlations  without  a  priori  knowledge  of  the  system  that  is 
to  be  controlled.  Jacobson  and  Reynolds,^  for  instance,  used 
a  neural  network  to  obtain  about  7%  drag  reduction  in  their 
simulation  of  an  artificial  flow.  In  this  paper,  we  describe 
how  we  constructed  and  trained  a  neural  network  off-line, 
and  then  implemented  an  on-line  control  scheme  (blowing 
and  suction  at  the  wall)  for  drag  reduction  based  on  that 
neural  network.  Applying  this  control  scheme  to  direct  nu¬ 
merical  simulations  of  turbulent  channel  flow  at  low  Rey¬ 
nolds  number,  we  observed  about  20%  drag  reduction.  We 
then  describe  how  examination  of  the  weight  distribution 
from  the  on-line  neural  network  led  to  a  very  simple  control 
scheme  that  worked  equally  well  while  being  computation¬ 
ally  more  efficient. 

In  Sec.  II,  a  brief  description  of  the  architecture  of  the 
neural  network  used  in  the  present  work  is  given.  In  Sec.  IB, 
results  obtained  from  control  using  an  off-line  trained  net¬ 
work  are  presented,  while  results  obtained  from  control  using 
a  network  with  continuous  on-line  training  are  given  in  Sec. 
rv.  In  Sec.  V,  a  simple  control  law  based  on  the  weight 
distribution  from  the  successful  neural  network  control  is 
presented.  A  few  turbulence  statistics  are  given  in  Sec.  VI, 
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Surface  Shear  (dw/dy^ 
Stresses 

FIG.  1.  Neural  network  architecture. 


followed  in  Sec.  VII  by  a  discussion  of  practical  implemen¬ 
tation  and  the  conclusions. 

In  this  paper  we  use  {x,y,z)  for  the  streamwise,  wall- 
normal,  and  spanwise  coordinates,  respectively,  and 
(m,u,w)  for  the  corresponding  velocity  components. 


11.  NEURAL  NETWORK 

In  this  section  we  describe  the  construction  of  a  neural 
network  to  learn  the  correlation  between  wall-shear  stresses 
and  wall  actuations  from  a  given  data  set.  Although  a  neural 
network  generally  requires  no  prior  knowledge  of  the  system 
(or  “plant”)*  knowledge  about  the  near- wall  turbulence 
structures  provides  a  guideline  for  the  design  of  the  network 
architecture.  Initially  duldy  and  dwidy  at  the  wall  at  several 
instances  of  time  were  used  as  input  data  fields  and  the  ac¬ 
tuation  at  the  wall  was  used  for  the  output  data  of  the  net¬ 
work.  Experimentally  we  found  that  only  dwidy  at  the  wall 
from  the  current  instance  of  time  was  necessary  for  sufficient 
network  performance.  Because  we  wanted  the  output  to  be 
based  only  on  a  local  input  area,  we  designed  our  network 
using  shared  weights.  The  network  had  a  single  set  of 
weights  (a  template)  that  is  convolved  over  the  entire  input 
space  to  generate  output  values;  that  is,  we  used  the  same  set 
of  weights  for  each  data  point  and  the  training  involves  iter¬ 
ating  over  all  data  points.  The  template  extracts  spatially 
invariant  correlations  between  input  and  output  data.  The 
size  of  the  template  was  initially  chosen  to  include  informa¬ 
tion  about  a  single  streak  and  streamwise  vortex,  and  then 
was  varied  to  find  an  optimal  size. 

We  used  a  standard  two-layer  feedforward  network  with 
hyperbolic  tangent  hidden  units  and  a  linear  output  unit  (see 
Fig.  1).  The  functional  form  of  our  final  neural  network  is 


(V-l)/2 


dw 


i;,*=lV„tanh  S 

\i=-(Ar-i)/2 


-W/,  -^c. 


j,k+i 


(1) 

where  the  W's  denote  weights,  N  is  the  total  number  of  input 
weights,  and  the  subscripts  j  and  k  denote  the  numerical  grid 
point  at  the  wall  in  the  streamwise  and  spanwise  directions 
respectively.  and  are  the  number  of  computational 


domain  grid  points  in  each  direction.  The  summation  is  done 
over  the  spanwise  direction.  Seven  neighboring  points 
(V=7),  including  the  point  of  interest,  in  the  spanwise  di¬ 
rection  (corresponding  to  approximately  90  wall  units  with 
our  numerical  resolution)  were  found  to  provide  enough  in¬ 
formation  to  adequately  train  and  control  the  near-wall  struc¬ 
tures  responsible  for  the  high-skin  friction.  Note  that  the 
blowing  and  suction  are  applied  at  each  grid  location  accord¬ 
ing  to  (1)  as  a  numerical  approximation  of  distributed  blow¬ 
ing  and  suction  on  the  surface.  A  scaled  conjugate  gradient 
learning  algorithm^  was  used  to  produce  rapid  training.  For 
given  pairs  of  ,^/(9y\ji^),  network  was  trained  to  mini¬ 
mize  the  sum  of  a  weighted-squared  error  given  by 

enoi=^'2  E  (2) 

where  is  the  desired  output  value  and  is  the  network 
output  value  given  by  Eq.  (1).  The  weights  were  initialized 
with  a  set  of  random  numbers.  Note  that  the  error  defined  in 
(2)  exponentially  emphasizes  (proportional  to  X.)  large  actua¬ 
tions.  This  error  scaling  was  chosen  based  on  Choi  et  aV^ 
observation  that  large  actuations  are  more  important  for  drag 
reduction.  Usually  within  100  training  epochs,  the  error 
reached  its  asymptotic  limit. 

III.  OFF-LINE  TRAINING  AND  CONTROL 

As  an  initial  experiment  we  investigated  whether  we 
could  train  a  neural  network  to  predict  the  velocity  at 
— 10  from  only  the  wall-shear  stresses.  The  rationale  be¬ 
hind  this  experiment  was  to  duplicate  an  existing  control  law 
using  only  measurements  available  at  the  wall  such  that  the 
output  from  the  network  could  be  used  as  input  to  the  actua¬ 
tor.  The  network  should  yield  a  similar  amount  of  drag  re¬ 
duction  to  that  obtained  by  Choi  et  al?  through  prediction  of 
the  velocity  at  y'*'  =  10.  The  training  data  consisted  of  100 
time  steps  of  output  obtained  from  a  numerical  simulation  of 
channel  flow  under  the  control  employed  by  Choi  et  i.e., 
using  the  wall-normal  velocity  aty  ■‘'  =  10.  The  flow  regime  is 
turbulent  channel  flow  with  Re^=  100,  where  Re^  is  the  Rey¬ 
nolds  number  based  on  the  wall-shear  velocity,  and  the 
channel  half- width,  8.  All  numerical  simulations  presented  in 
this  paper  were  obtained  using  a  modified  version  of  Kim 
etaV^  spectral  code  with  the  computational  domain 
(47r,2,47r/3)5,  and  a  grid  resolution  of  (32,  65,  32)  in  the 
(jc,y,z)  directions,  respectively.  Each  time  step  contained  a 
32X32  array  of  input  values  {^wl8y\^)  and  corresponding 
actuations,  —v  at  y"*"  =  10. 

We  trained  several  networks  with  one  hidden  unit  and 
different  sized  input  templates:  7X1,  7X3,  7X5,  9X1, 
9X3,  11X1,  and  11X3  (the  number  of  input  points  in  the 
spanwise  direction  by  the  number  of  input  points  in  the 
streamwise  direction).  After  training  was  completed,  the  dis¬ 
tribution  of  input  weights  was  examined  to  see  whether  there 
was  a  discernible  pattern  in  the  input  template.  The  weight 
distributions  in  the  spanwise  direction  at  the  same  stream- 
wise  location  for  different  input  templates  are  shown  in  Fig. 
2.  The  same  pattern  for  all  7  input  template  sizes  was  ob¬ 
served,  and  is  similar  to  a  finite  difference  approximation  of 
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FIG.  2.  Weight  distribution  from  off-line  trainings  for  various  sizes  of  the 
input  template:  7  X  1,7X3,7X5,9X  1,9X3,  IIX  1,11X  3.  Weights  are 
normalized  by  Wj. 


spanwise  differentiation.  Increasing  the  number  of  hidden 
units  only  marginally  improved  performance,  whereas  in¬ 
creasing  the  template  size  significantly  reduced  the  final 
training  error. 

We  then  applied  a  control  scheme  to  a  regular  channel 
flow  by  fixing  the  final  input  weights  obtained  from  the  off¬ 
line  training.  This  was  perhaps  a  somewhat  naive  approach 
since  the  weights  were  obtained  from  fully  controlled  flow 
data  and  the  shear  stresses  used  for  the  training  were  already 
altered  by  the  actuations.  Nevertheless,  two  cases  were 
tested:  a  control  scheme  based  on  7  weights  in  the  spanwise 
direction,  and  another  based  on  the  same  7  weights  plus  3 
more  in  the  immediate  downstream  location.  These  10 
weights  were  chosen  because  among  all  weights  obtained 
from  the  off-line  training  they  had  non-negligible  values.  In 
Fig.  3,  the  mean  shear  stress  variations  at  the  wall  (i.e.,  drag) 
obtained  with  these  two  controls  are  plotted  along  with  the 
no-control  case.  Nearly  18%  drag  reduction  was  achieved, 
with  slightly  better  performance  from  the  10-weight  net¬ 
work. 

These  results  demonstrate  that  a  correlation  exists  be¬ 
tween  the  shear  stresses  at  the  wall  and  the  desired  actua- 


FIG.  3.  Mean  wall-shear  stress  histories  for  various  control  laws  compared 
to  the  no-control  case:  — ,  no  control;  — ,  control  with  7  fixed  weights 
obtained  from  off-line  control;  •**,  control  with  10  fixed  weights  from  the 
same  off-line  control.  All  runs  with  control  shown  in  this  paper  were  inte¬ 
grated  over  at  least  20  time  units  that  is  much  longer  than  the  typical  turn¬ 
over  time  of  the  streamwise  vortices,  1  time  unit. 


FIG.  4.  Schematic  representation  of  adaptive  inverse  model  control. 


tions,  and  that  control  based  on  this  correlation  produces  a 
significant  amount  of  drag  reduction.  This  fixed-weight  con¬ 
trol  scheme,  however,  was  deduced  from  fully  controlled 
flow  data;  thus  it  does  not  guarantee  the  same  performance 
for  other  flow  situations. 


IV.  ON-LINE  CONTROL 


In  the  previous  section  we  showed  how  successful  con¬ 
trol  based  on  off-hne  training  can  be  obtained.  However, 
since  the  system  we  are  trying  to  control  is  time- varying  and 
nonlinear,  this  approach  is  not  likely  to  generalize  well.  Con¬ 
tinuous  on-line  training  allows  a  controller  to  adapt  to  the 
evolution  of  the  system.  In  this  section,  we  describe  an  adap¬ 
tive  controller  for  on-line  training  and  control. 

There  are  various  schemes  for  on-line  neural  network 
control.  The  most  direct  scheme  is  adaptive-inverse  model 
control. A  schematic  representation  of  this  approach  is 
shown  in  Fig.  4.  Here  the  plant  denotes  the  numerical  solver 
of  the  Navier-Stokes  equations.  This  configuration  employs 
a  neural  network  to  model  the  (possibly  time-varying)  in¬ 
verse  plant  mapping  from  wall-shear  stress  dwldy\y^  to  the 
wall-normal  actuations,  and  then  uses  a  copy  of  the  model  as 
the  controller  with  the  desired  shear  stresses  as  input.  One 
restriction  of  this  technique  is  that  it  usually  requires  an  ini¬ 
tial  model  training  phase  using  random  plant  inputs  and  cor¬ 
responding  plant  outputs.  This,  however,  caused  no  serious 
problems  since  usually  one  time  step  was  enough  for  model 
training  because  of  the  shared-weight  architecture  in  our  ap¬ 
plication  (each  time  is  1024  data  points  and  our  networks 
only  have  a  few  weights).  Once  the  model  represents  a  rea¬ 
sonably  close  approximation  to  the  actual  plant  inverse,  a 
copy  is  then  implemented  as  a  feedforward  controller. 

The  desired  inputs  to  the  controller  are  a  fractional  re¬ 
duction  in  the  shear  stress  from  the  previous  step,  i.e.. 


(c>w\ 


(3) 


where  0<97<1.  Based  on  the  networks  and  techniques  we 
investigated,  this  indirect  suppression  of  dwldy\y^ ,  instead  of 
turns  out  to  be  more  efficient  in  achieving  drag 
reduction.  The  output  of  the  controller,  which  is  the  input  to 
the  plant,  is  the  predicted  actuation  necessary  to  produce  this 
shear-stress  reduction.  The  quantity  77  should  be  chosen  such 
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FIG.  5.  Distribution  of  the  weights:  0,  from  on-line  training;  — , 
A(\-cos(vj))/j.  Az‘^=13. 


that  sets  of  the  desired  large  amplitude  outputs  are  among  the 
sets  that  are  well  represented  by  the  training  sets.  Good  per¬ 
formance  was  achieved  for  the  range  of  77=0.8-0.85. 

A  turbulent  channel  flow  at  Reynolds  number 
Re^=  100  was  used  to  test  the  neural  network.  We  allowed 
all  the  weights  in  the  network  to  adapt  and  examined  the 
input  template  pattern  after  each  time  step.  As  the  control 
began,  the  weight  distribution  immediately  assumed  a  fixed 
pattern  similar  to  the  off-line  pattern.  There  was  no  appre¬ 
ciable  change  in  the  relative  magnitudes  of  the  template 
weights  over  time,  indicating  that  the  pattern  is  preserved. 
The  absolute  magnitudes,  however,  did  vary  indicating  the 
need  for  gain  and  bias  adaptation  for  each  layer.  The  number 
of  hidden  layers,  the  number  of  the  hidden  units,  the  size  of 
the  input  template,  and  the  error  scale  of  the  training  error 
[X  of  Eq.  (2)]  were  1,  1,  7X1,  and  5,  respectively.  We 
varied  the  values  of  the  error  scale  and  found  5  to  be  opti¬ 
mum,  with  larger  values  causing  an  instability  in  training. 
We  also  varied  the  number  of  the  hidden  units,  but  the  net¬ 
work  with  a  single  hidden  unit  produced  the  best  result.  The 
converged  weight  distributions  for  20  consecutive  time  steps 
after  an  asymptotic  state  was  reached  are  shown  in  Fig.  5. 
The  pattern  is  only  slightly  different  from  the  one  obtained 
from  the  off-line  training  (see  Fig.  2).  It  should  be  noted  that 
this  pattern  emerged  immediately  after  the  on-line  control 
began. 

Time  histories  of  the  wall-shear  stress  for  3  different 
input  template  sizes  are  shown  in  Fig.  6.  All  other  network 
parameters  are  kept  the  same.  The  abscissa  represents  non- 
dimensional  time  normalized  by  and  S.  One  time  unit 
roughly  corresponds  to  the  time  for  a  fluid  particle  traveling 
with  the  centerline  velocity  to  move  about  20  channel  half 
widths.  The  computations  were  carried  out  long  enough  to 
ensure  that  a  statistically  steady  state  was  reached.  As  the 
control  began,  the  drag  quickly  drops  to  about  80%  of  that 
observed  without  control,  for  the  two  cases  with  template 
sizes  of  7  X 1  and  9X1.  The  template  size  of  5  X 1 ,  however, 
did  not  produce  as  much  reduction,  indicating  that  at  least  7 
spanwise  points  (which  extended  about  90  wall  units,  with 
our  grid  resolution)  should  be  used  for  good  performance.  At 
the  initial  stage  of  our  study,  we  tested  a  control  using  both 
dwldy\^  and  duldy\^  as  input,  with  a  7X5  input  template. 
It  produced  about  the  same  amount  of  reduction,  but  with 
excessive  training  time  due  to  a  significantly  larger  number 


FIG.  6,  Mean  wall-shear  stress  histories  for  on-line  control  with  different 
input  template  sizes:  — ,  no  control;  control  with  5X1;  — ,  control  with 

7X1;  — ,  control  with  9X 1. 


of  weights.  Furthermore,  it  did  not  produce  coherent  patterns 
in  the  weight  distributions. 

Since  the  7  weights  showed  a  fixed  pattern,  we  fixed  the 
input  template  weights  to  this  pattern  and  used  a  single  hid¬ 
den  unit  network,  giving  only  4  adaptable  parameters  (a  bias 
and  gain  for  each  layer).  This  simplified  network  had  the 
following  functional  form: 


tanh(Wi,g-W,)-W^  (4) 

with 


j,k  +  i 


(5) 


where  Wj's  are  the  fixed- weight  pattern  obtained  from  the 
previous  on-line  control.  On-line  control  using  this  network 
produced  a  similar  amount  of  drag  reduction.  The  weight 
variations  with  time  were  also  monitored.  The  bias  weights 
{Wc  and  Wa)  were  negligibly  small.  Those  controlling  the 
gain  {Wa  and  W^),  which  had  finite  values,  changed  in  time 
significantly,  although  the  product  of  the  two  gain-weights 
remained  almost  constant.  This  suggests  that  effective  con¬ 
trol  can  be  achieved  by  simply  using  g  with  an  adjustable 
amplitude.  This  will  be  discussed  in  the  following  section. 


V.  A  SIMPLE  CONTROL  SCHEME 


Since  the  control  based  on  the  network  given  by  Eq.  (4) 
produced  a  substantial  reduction  in  drag,  this  section  devel¬ 
ops  a  control  scheme  based  only  on  the  weighted  sum  of  the 
wall-shear  stress.  The  distribution  of  the  weights  can  be  ap¬ 
proximated  by  (see  Fig.  5): 


1  “-cos(7r/) 
Wi=A - 


(6) 


where  ^  =  0  corresponds  to  the  point  where  the  control  is 
applied.  Since  only  the  relative  values  are  important,  the  con¬ 
stant  A  has  no  special  meaning.  A  computation  with  a  twice 
higher  resolution  was  run  to  confirm  that  Eq.  (6)  gives  the 
proper  form  of  the  weight  distribution  (see  Fig.  7).  It  turns 
out  that  Eq.  (6)  is  the  inverse  Fourier  transform  of  ikj\k^, 
where  is  the  wave  number  in  the  spanwise  direction,  for 
the  finite  maximum  wave  number,  i.e.. 
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FIG.  7.  Distribution  of  the  weights:  0 ,  from  on-line  training; — , 
A(l— cos(737))/7.  Az^  =  6. 


FIG.  9.  Mean  wall-shear  stress  histories  for  various  control  schemes  com¬ 
pared  to  the  no-control  case:  — ,  no  control;  — ,  on-line  control  with  neural 
network  with  7X 1  template;  control  with  7  fixed  weights. 


ik^ 

rj-T&xpi-ik^z)dki=2 


l-cos{k„z) 


(7) 


where  k„=7rlAz  is  the  maximum  wave  number  and  Az  is 
the  numerical  grid  spacing.  Replacing  z  with  jAz  in  the 
right-hand-side  of  Eq.  (7)  leads  to  Eq.  (6).  From  this  result 
and  the  convolution  theorem,  one  can  suggest  the  following 
simple  control  law  for  wall  transpiration: 


iky  (Pw 


1  of 
^\kz\  * 


(8) 


where  the  “hat”  denotes  a  Fourier  transformed  quantity  and 
C  is  a  positive  scale  factor  determining  the  amplitude  of  the 
actuation.  Equation  (8),  which  should  produce  similar  drag 
reduction  as  the  result  with  7  fixed  weights  [Eq.  (4)],  implies 
that  the  optimum  blowing  and  suction  at  the  wall  is  propor¬ 
tional  to  d{^ISy)ldz  with  the  high  wave  number  compo¬ 
nent  suppressed  by  \l\k^.  Note  that  the  wall-normal  actua¬ 
tion  proportional  to  d{dwl dy)l dz  counteracts  the  up-and- 
down  motion  induced  by  a  streamwise  vortex  (see  Fig.  8), 
consistent  with  that  used  by  Choi  etal}  Smith  et  al}^  dis¬ 
cussed  the  stability  of  the  near- wall  turbulence  structure  to  a 
local  adverse  pressure  gradient  imposed  near  the  surface. 


FIG.  8.  A  schematic  of  a  flow  field  induced  by  a  streamwise  vortex  and 
corresponding  d{dwldy\J)ldz. 


The  suction  at  the  left  side  of  the  vortex  in  Fig.  8  can  relieve 
such  adverse  pressure  gradient,  thus  preventing  the  surface- 
layer  eruption.  Choi  et  al}  reported  that  actuation  based  only 
on  the  local  value  of  d{d\^l dy)! dz^  which  is  also  a  dominant 
term  of  the  Taylor  expansion  of  v  at  some  distance  away 
from  the  wall,  did  not  produce  such  a  substantial  drag  reduc¬ 
tion.  This  indicates  that  suppression  of  high  wave  number 
components  by  or  equivalently  using  locally  weighted 
value  of  dwi&y,  plays  a  key  role  in  reducing  drag.  Note  that 
the  weights  at  even  numbered  grid  points  away  from  the 
center  point  vanish.  This  is  beneficial  for  physical  implemen¬ 
tation,  since  sensors  and  actuators  cannot  be  placed  at  the 
same  location.  Because  the  Fourier  integral  is  computed  for 
a  finite  value  of  ,  the  values  of  the  weight  at  noninteger 
j  in  Eq.  (6)  have  no  meaning.  Equation  (8)  is  equivalent  to 


iN- 1)/2 

Vjk-C  S  W,— 


j,k  +  i 


(9) 


where  is  given  by  Eq.  (6).  The  magnitude  of  the  weights 
decays  with  increasing  distance  from  the  center,  which  al¬ 
lows  for  good  approximation  using  only  a  small  number  of 
weights,  i.e.,  successful  control  requires  only  local  values  of 
the  shear  stress.  A  natural  concern  is  how  different  grid  spac- 
ings  affect  the  control.  Since  the  above  control  law  [Eq.  (9)] 
is  simply  another  expression  of  Eq.  (8),  which  is  a  good 
approximation  as  long  as  is  large  enough,  the  control 
should  be  relatively  independent  of  resolution.  This  was  con¬ 
firmed  by  a  computation  with  a  higher  resolution. 

Control  based  on  Eq.  (9)  with  7  points  produced  the 
same  amount  of  drag  reduction  (20%)  as  the  neural  network 
control  (see  Fig.  9).  The  constant  C  is  chosen  so  that  the 
root-mean-squared  (rms)  value  of  the  actuation  is  kept  at 
0.15m^.  Blowing  and  suction  of  this  magnitude  at  the  wall 
suppress  the  near-wall  streamwise  vortices  by  counteracting 
the  up-and-down  motions  associated  with  these  vortices  (see 
Fig.  8). 

The  fact  that  a  control  scheme  that  is  essentially  linear 
[Eq.  (9)]  produced  the  same  amount  of  drag  reduction  as 
on-line  control  with  a  nonlinear  network  led  to  the  following 
question:  Can  the  same  weight  pattern  be  captured  when 
on-line  control  with  a  “linear”  network  is  used?  To  answer 
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(a) 


(b) 


HG.  10.  Contours  of  streamwise  vorticity  in  a  cross-flow  plane:  (a)  no 
control;  (b)  control  using  7  fixed  weights.  The  contour  level  increment  is  the 
same  for  both  figures.  Negative  contours  are  chain-dotted. 


this  question,  we  tested  an  on-line  controller  utilizing  a  linear 
network.  The  hyperbolic  tangent  function  in  Eq.  (1)  was  re¬ 
placed  with  a  linear  function  and  the  hidden  layer  was  re¬ 
moved.  All  other  parameters  were  kept  the  same  as  the  non¬ 
linear  network  case.  The  linear  network  seemed  to  produce 
almost  the  same  pattern  in  weights  as  the  one  produced  by 
our  nonlinear  network.  Also,  drag  was  reduced  by  the  same 
amount.  The  pattern  that  was  obtained  by  averaging  over 
time,  however,  was  not  well  preserved  in  time.  The  weight 
of  Eq.  (i),  for  example,  frequently  showed  order  of  mag¬ 
nitude  excursions  resulting  in  a  large  standard  deviation,  1.5, 
compared  to  the  value  of  the  weight,  0.3  (weights  were  nor¬ 
malized  by  the  maximum  amplitude  Wj).  The  nonlinear  net¬ 
work,  however,  produced  more  constant  weights;  the  stan¬ 
dard  deviation  of  was  only  0.15.  This  result  indicates  that 
the  nonlinearity  should  add  robustness  by  bounding  the 
weights.  This  can  simplify  hardware  implementation  by  lim¬ 
iting  the  signals  and  subsequently  the  necessary  dynamic 
range  of  the  processing  circuitry. 


VI.  TURBULENCE  STATISTICS 

The  computed  flow  fields  for  a  no-control  case  and  a 
successful  control  case,  based  on  the  7-point  weighted  sum 
of  [Eq.  (9)],  were  examined  to  investigate  the 

mechanism  by  which  the  drag  reduction  is  achieved.  The 
most  salient  feature  of  the  controlled  case  was  that  the 
strength  of  the  near-wall  streamwise  vortices  was  drastically 
reduced.  In  Fig.  10,  contours  of  streamwise  vorticity  in  a 
cross  plane  are  shown.  This  result  further  substantiates  the 
notion  that  a  successful  suppression  of  the  near-wall  stream- 


FIG.  11.  Probability-density  function  of  the  wall-shear  stress;  — ,  no  con¬ 
trol;  — ,  control  with  7  fixed  weights.  The  area  under  each  curve  is  normal¬ 
ized  to  one. 

wise  vortices  leads  to  a  significant  reduction  in  drag.  Note 
that  for  the  controlled  case  the  wall  actuations  were  applied 
at  both  walls. 

The  probability-density  function  of  the  wall-shear  stress 
in  the  streamwise  direction  is  shown  in  Fig.  11.  It  is  evident 
that  the  control  is  very  effective  in  suppressing  large  fluctua¬ 
tions,  thus  reducing  the  mean-skin  friction.  Furthermore,  the 
root-mean-square  (rms)  values  of  turbulent  fluctuations  in 
the  wall  region  are  also  reduced  as  shown  in  Fig.  12(a).  The 
same  trend  was  observed  by  Choi  et  al^  The  rms  vorticity 
fluctuations  are  also  significantly  reduced,  except  for  eoj^  very 


(a) 


FIG.  12.  Root-mean-square  fluctuations  normalized  by  the  wall  variables: 
— no  control;  control  with  7  fixed  weights,  (a)  Velocity  fluctuations;  (b) 
vorticity  fluctuations. 
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FIG.  13.  Contours  of  the  wall  actuation:  (a)  control  using  dwldy\y,  with  7 
fixed  weights;  (b)  control  using  information  at  >>''■  =  10.  Negative  contours 
are  chain-dotted. 


close  to  the  wall.  The  increase  for  the  latter  is  caused  by 
additional  ^Idz  due  to  the  wall  actuation.  The  rms  fluctua¬ 
tions  of  o)^  at  the  wall,  which  is  mainly  due  to  duiSy  at  the 
wall,  is  also  decreased.  This  indicates  that  the  control  scheme 
led  to  a  reduction  in  the  mean  shear  and  its  variance  at  the 
wall  by  suppressing  large  fluctuations  as  shown  in  Fig.  11. 
The  reduction  in  the  rms  fluctuations  in  and  (Oy  indicates 
that  the  control  scheme  indeed  reduced  the  strength  of  the 
near-wall  streamwise  vortices  and  the  wall-layer  streaks. 

Figure  13  compares  the  distribution  of  wall  actuation 
used  in  our  control  with  that  from  Choi  etaV^  i; -control 
using  the  information  at  y'*'  =  10  for  the  same  wall-shear 
stress  distribution.  The  corresponding  wall-shear  stress  dis¬ 
tribution  is  also  shown  in  Fig.  13.  The  wall  actuations  indi¬ 
cate  a  strikingly  similar  distribution  to  each  other,  even 
though  the  wall  actuation  of  our  control  is  based  only  on  the 
wall-shear  stress  dwldy\y; .  Basically  our  control  scheme  de¬ 
tects  edges  of  locally  high-shear  stress  regions  by  measuring 
the  span  wise  variation  of  dwidy,  and  applies  appropriate 
wall  actuation  as  shown  in  Figs.  13(a)  and  13(b).  Since  high- 
shear  stress  regions  are  usually  elongated  in  the  streamwise 
direction,  only  spanwise  variation  is  necessary  for  detecting 
the  edges.  Since  dwidy  is  a  direct  measurement  of  stream- 
wise  vortices  (see  Fig.  8),  it  provides  more  appropriate  infor¬ 
mation  than  duldy.  This  is  consistent  with  our  finding  that 
dwidy  at  several  points  in  spanwise  direction  are  enough  for 
good  performance  of  control. 

VII.  DISCUSSION  AND  CONCLUSION 

We  have  presented  a  successful  application  of  a  neural 
network  to  turbulence  control  for  drag  reduction.  First  we 


were  able  to  construct  and  train  a  neural  network  off-line  to 
find  a  correlation  between  the  wall-shear  stress  and  the  de¬ 
sired  wall  actuations  coming  from  the  information  at 
y"^  =  10.  Based  on  the  optimal  network  structure  from  the 
off-line  training,  we  successfully  implemented  an  on-hne  in¬ 
verse  model  controller  in  numerical  experiments  of  a  turbu¬ 
lent  channel  flow,  resulting  in  about  20%  drag  reduction. 
Finally,  we  were  able  to  deduce  a  simple  control  scheme 
[Eq.  (9)]  for  drag  reduction  based  on  observation  of  the 
weight  distribution  from  a  successful  control  case.  This  con¬ 
trol  scheme  uses  a  minimal  amount  of  wall-shear  stress  in¬ 
formation  and  requires  only  simple  operations,  thus  render¬ 
ing  a  scheme  whose  actual  hardware  implementation  would 
be  relatively  easy. 

Beginning  with  a  nonlinear  network,  we  found  a  simple 
hnear  control  scheme  from  the  pattern  of  the  weights,  sug¬ 
gesting  the  use  of  a  linear  network.  Although  a  linear  net¬ 
work  produced  a  similar  pattern,  the  pattern  was  not  well 
preserved  in  time.  The  weights  can  vary  significantly  in¬ 
creasing  the  difficulty  in  circuit  implementation  due  to  the 
larger  required  dynamic  range.  The  nonlinear  network,  how¬ 
ever,  produced  bounded  weights  simplifying  hardware 
implementation.  This  suggests  that  a  certain  amount  of  non¬ 
linearity  is  beneficial  to  capture  a  stable  coherent  pattern  for 
our  system. 

Although  the  present  control  scheme  is  a  significant  im¬ 
provement  over  earlier  approaches  that  require  velocity  in¬ 
formation  within  the  flow,  there  is  a  technical  issue  before 
the  present  control  scheme  can  be  implemented  in  real  prac¬ 
tice.  Precise  control  of  blowing  and  suction  distributed  over 
a  surface  is  difficult  to  implement.  Other  approaches,  such  as 
surface  movements  by  deformable  surface,  may  prove  to  be 
more  practical. 

There  are  also  several  fundamental  issues  that  must  be 
addressed.  First,  our  numerical  experiments  were  performed 
in  a  very  low  Reynolds  number  flow.  It  remains  to  be  seen 
whether  the  same  control  scheme  extends  to  higher  Reynolds 
numbers.  If  the  main  cause  of  high-skin  friction  in  turbulent 
boundary  layers  at  higher  Reynolds  numbers  is  also  due  to 
the  near-wall  streamwise  vortices,  the  same  scheme  should 
work  equally  well.  Detection  of  these  vortices  through  the 
wall-shear  stress  would  become  more  difficult,  however,  be¬ 
cause  the  scales  associated  with  these  vortices  decrease  as 
the  Reynolds  number  increases. 

Another  important  issue  is  the  influence  of  the  spatial 
resolution  of  sensors  and  actuators  on  performance.  We 
showed  that  a  simple  control  scheme  with  4  neighboring 
points  in  the  spanwise  direction  (3  among  7  weights  are  ze¬ 
ros),  which  span  90  wall  units,  performed  very  well.  This 
suggests  that  the  distance  between  sensors  should  be  about 
20  wall  units.  As  the  Reynolds  number  increases,  the  physi¬ 
cal  separation  between  sensors  must  decrease.  We  note  in 
passing  that  recent  advances  in  micromachined  sensors  and 
actuators  make  these  scales  feasible,'^  and  the  present  work 
is  part  of  a  joint  research  project  aimed  at  integrating  micron¬ 
sized  sensors  and  actuators  with  analog  VLSI  control  cir¬ 
cuitry. 

A  third  issue  is  determining  an  appropriate  scale  factor 
C  in  Eq.  (9).  We  tested  a  range  of  C  and  empirically  found 
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that  values  yielding  an  actuation  rms  value  between  0.1 
and  0.15m^  gave  the  best  performance.  Smaller  values  re¬ 
sulted  in  less  reduction,  while  larger  ones  caused  rapid  fluc¬ 
tuations  of  the  wall-shear  stress  over  time.  The  experiments 
were  only  performed  for  Re^=100  and  180,  from  which  the 
optimum  value  was  deduced,  but  we  expect  that  the  same 
amplitude  level  should  produce  similar  reductions  for  higher 
Reynolds  number  flows.  We  further  note  that  for  this  ampli¬ 
tude  the  required  power  input  per  unit  area  to  produce  the 
actuation,  p^Vw+05pv\,{^0.\pu\)  was  significantly  less 
compared  to  the  power  saved  due  to  the  reduced  drag, 
lUiCflCf  r^Uc{^?>.5pu\)  for  20%  reduction  in  Cf.  Here 
Pw »  Ihe  wall  pressure,  density,  friction 

coefficient,  averaged  wall-shear  stress  in  the  streamwise  di¬ 
rection,  and  centerline  velocity,  respectively.  Note  that  we 
did  not  account  for  device  loss  to  deliver  blowing  and  suc¬ 
tion  nor  the  power  consumed  by  sensors  in  this  estimate. 

One  final  practical  issue  worth  mentioning  is  the  time 
delay  between  sensing  and  actuation.  None  was  included  in 
any  of  our  numerical  experiments.  In  a  real  situation,  how¬ 
ever,  there  will  be  a  finite  time  delay  between  sensing  and 
actuation.  The  process  time  for  the  simple  control  scheme 
should  be  negligible  since  a  weighted  summation  is  an  in¬ 
volved  process,  thus  not  imposing  any  restriction  on  the  time 
delay.  Ideally,  sensors  should  register  the  response  of  the 
near- wall  structures  to  be  controlled  due  to  wall  actuations, 
which  will  take  a  certain  amount  of  time.  Blowing  and  suc¬ 
tion  at  the  wall,  however,  can  produce  an  immediate  spurious 
response  at  nearby  sensors  due  to  local  conservation.  One 
possible  remedy  to  this  problem  is  an  under-relaxation  of  the 
actuation  signal  with  past  signals,  accounting  for  the  re¬ 
sponse  of  a  longer  time  scale.  For  example,  an  underrelax¬ 
ation  scheme  using  the  following  formula 

w,—  (10) 

'  ^  j.k  +  i 

could  take  into  account  all  past  signals  with  a  single  param¬ 
eter  ^  (0<  1 ).  We  used  this  approach  successfully  in  our 

numerical  experiment  for  Re^=  180  and  found  that  there  is 
an  optimum  ^  depending  on  temporal  resolution. 

The  most  significant  finding  of  the  present  study  is  that  a 
single  spanwise  strip  of  dwlSy\y,  is  enough  to  achieve  sig¬ 
nificant  drag  reduction.  Additional  information  about 
in  the  streamwise  direction  or  dulSy\y^  in  either 
direction  reduced  the  efficiency  of  our  neural-network  based 
control.  We  also  tested  different-sized  input  templates  and 
found  that  as  the  template  size  increased  in  the  streamwise 
direction,  the  training  time  became  excessive  and  the  fixed 
pattern  in  the  weight  disappeared,  indicating  that  too  many 
unnecessary  weights  caused  training  problems. 
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Two  simple  feedback  control  laws  for  drag  reduction  are  derived  by  applying  a 
suboptimal  control  theory  to  a  turbulent  channel  flow.  These  new  feedback  control 
laws  require  pressure  or  shear-stress  information  only  at  the  wall,  and  when  apphed 
to  a  turbulent  channel  flow  at  Re^  =  110,  they  result  in  16-22%  reduction  in  the 
skin-friction  drag.  More  practical  control  laws  requiring  only  the  local  distribution  of 
the  wall  pressure  or  one  component  of  the  wall  shear  stress  are  also  derived  and  are 
shown  to  work  equally  well. 


1.  Introduction 

Recent  studies  have  shown  that  near-wall  streamwise  vortices  are  responsible 
for  high  skin-friction  drag  in  turbulent  boundary  layers.  Many  attempts  aiming  at 
controlling  these  vortices  have  been  made  in  order  to  achieve  a  skin-friction  drag 
reduction  in  turbulent  boundary  layers.  Most  of  such  attempts,  however,  have  been 
ad  hoc,  largely  based  on  physical  intuition.  The  active  control  by  Choi,  Moin  &  Kim 
(1994),  for  example,  used  blowing  and  suction  at  the  wall  that  is  equal  and  opposite 
to  the  wall-normal  component  of  the  velocity  at  =  10,  resulting  in  as  much  as 
25%  reduction  in  their  numerical  simulation.  This  approach,  however,  is  impractical 
since  the  required  velocity  information  at  y"*"  =  10  is  not  normally  available.  For 
any  practical  implementation  a  control  scheme  should  be  based  solely  on  quantities 
measurable  at  the  wall. 

A  more  systematic  approach  based  on  an  optimal  control  theory  was  proposed 
by  Abergel  &  Temam  (1990).  Choi  et  al.  (1993)  proposed  a  ‘suboptimal’  control 
procedure,  in  which  the  iterations  required  for  a  global  optimal  control  were  avoided 
by  seeking  an  optimal  condition  over  a  short  time  period.  The  suboptimal  control 
procedure  was  successfully  applied  to  control  of  the  Burgers  equation.  Bewley  & 
Moin  (1994)  were  the  first  to  apply  the  suboptimal  control  procedure  to  a  turbulent 
flow  and  reported  about  17%  drag  reduction.  The  procedure  developed  by  Bewley 
&  Moin  (1994)  still  requires  velocity  information  inside  the  flow  in  order  to  solve 
the  adjoint  problem,  from  which  a  feedback  control  input  was  derived.  In  spite  of 
this  obvious  drawback,  however,  the  fact  that  a  control  theory  applied  to  a  turbulent 
flow  resulted  in  a  substantial  drag  reduction  is  encouraging,  since  their  control 
procedure  was  derived  rigorously  from  a  control  theory,  in  which  a  pre-determined 
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cost  functional  was  minimized.  Hill  (1993, 1994)  derived  a  control  input  as  a  function 
of  the  streamwise  wall  shear  only  by  modeling  the  near-wall  flow  with  a  spanwise 
velocity  growing  linearly,  and  normal  velocity  growing  quadratically,  with  normal 
distance  from  the  wall.  About  15%  drag  reduction  was  obtained  from  this  study. 
We,  however,  derive  a  simple  control  scheme  by  minimizing  cost  functionals  that 
are  related  to  the  streamwise  vortices,  which  have  been  found  to  be  responsible  for 
large  local  drag  in  turbulent  boundary  layers.  We  also  tried  to  minimize  drag  directly 
by  having  drag  itself  in  the  cost  functional,  but  it  was  not  successful  (see  §  3).  The 
oljjective  of  this  paper  is  to  demonstrate  that  a  wise  choice  of  the  cost  functional 
coupled  with  a  variation  of  the  formulation  can  lead  to  a  more  practical  control  law. 

We  present  how  to  choose  a  cost  functional  and  how  to  minimize  it  to  yield  simple 
feedback  control  laws  that  require  quantities  measurable  only  at  the  wall.  One  of 
the  laws  requires  spatial  information  on  the  wall  pressure  over  the  entire  wall  and 
the  other  requires  information,  also  over  the  entire  wall,  on  one  component  of  the 
wall  shear  stress.  We  then  derive  more  practical  control  schemes  that  only  require 
local  wall  pressure  or  local  surface  shear  stress  information,  and  show  that  they  work 
equally  well. 


2.  Suboptimal  procedure 

We  follow  a  similar  procedure  used  by  Choi  et  al.  (1993)  and  Bewley  &  Moin 
(1994).  The  problem  under  consideration  is  a  turbulent  channel  flow,  for  which  the 
governing  equations  are  the  Navier-Stokes  and  continuity  equations  with  the  no-slip 
boundary  condition: 

dui  dui  _  Sp  1  d^Ui 
dt  dxi^  Re  dxjdx/ 


with 

Ui\y,  =  (l)ix,z,t)da,  (2.3) 

where  t  is  the  time,  xi,X2,X3  are  the  streamwise,  wall-normal,  and  spanwise  directions 
respectively,  are  the  corresponding  velocity  components,  p  is  the  pressure,  and 
Re  is  the  Reynolds  number,  and  the  control  input  is  the  wall-normal  velocity  at  the 
wall,  (f). 

All  variables  are  non-dimensionalized  by  the  wall  shear  velocity,  u^,  and  the  channel 
half- width,  S.  We  also  use  interchangeably  x,y,z  for  x,-  and  u,v,w  for  Ui.  Periodic 
boundary  conditions  are  imposed  in  the  streamwise  and  spanwise  directions.  The  flow 
rate  in  the  streamwise  direction  is  kept  constant,  and  the  drag  is  measured  by  the 
mean  pressure  gradient  necessary  to  maintain  the  constant  flow  rate. 

We  found  that  the  choice  of  the  cost  functional  to  be  minimized  is  critical  in 
the  performance  of  the  control.  Since  the  streamwise  vortices  have  been  known  to 
be  responsible  for  large  drag  in  turbulent  boundary  layers,  we  tried  to  choose  the 
cost  functional  that  is  directly  related  to  them.  This  is  based  on  our  conjecture  that 
a  suitable  manipulation  of  the  streamwise  vortices  would  lead  to  drag  reduction. 
We  carefully  selected  two  cost  functionals  based  on  our  observation  of  a  successful 
control  of  Choi  et  al.  (1994).  As  shown  in  figure  1,  Choi  et  al.'s  (1994)  blowing 
and  suction,  which  are  equal  and  opposite  to  the  wall-normal  velocity  component 
at  y*"  =  10,  effectively  suppress  a  streamwise  vortex  by  counteracting  up-and-down 


(2.1) 

(2.2) 
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Figure  1.  Schematic  of  a  pressure  field  induced  by  a  control  based  on  y+  =  10. 


figure  2.  Correlation  between  a  pressure  field  of  a  no-control  case  and  a  pressure  field  modified 
by  a  control  based  on  information  at  y+  =  10.  A  line  indicating  no  change  of  the  pressure  field  is 
drawn  for  guidance. 


motion  induced  by  the  vortex.  This  blowing  and  suction  creates  locally  high  pressure 
in  the  near-wall  region  marked  with  and  low  pressure  in  the  region  marked  with 
’  in  figure  1.  A  crucial  aspect  of  the  present  analysis  is  the  observation  that  this 
blowing  and  suction  increases  the  pressure  gradient  in  the  spanwise  direction  under 
the  streamwise  vortex  near  the  wall.  To  demonstrate  this  behaviour,  we  examine 
computed  flow  fields.  Figure  2  shows  a  scatter  plot  between  two  pressure  gradient 
fields:  dp/dz\^  is  the  pressure  gradient  before  the  control  is  applied  and  dp/dz\f,  is 
the  pressure  gradient  at  the  same  location  after  the  control  of  Choi  et  al.  (1994) 
is  applied  for  one  time  step.  It  is  apparent  that  the  control  increased  the  pressure 
gradient  significantly.  For  the  uncontrolled  case,  Re^  based  on  the  wall-shear  velocity 
and  the  channel  half-width  is  about  110.  The  spectral  code  of  Kim,  Moin  &  Moser 
(1987)  is  used  for  all  the  computations  presented  here.  Simulations  are  carried  out 
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using  32  X  65  X  32  spectral  modes  (with  dealiasing  in  the  streamwise  and  spanwise 
directions)  with  the  computational  domain  of  4n5  x  25  x  47r5/3,  respectively. 

The  above  argument  suggests  that  we  should  seek  blowing  and  suction  that  increases 
the  pressure  gradient  in  the  spanwise  direction  near  the  wall  for  a  short  time  period 
(i.e.  in  the  suboptimal  sense)  in  order  to  achieve  a  similar  drag  reduction  to  that 
achieved  by  Choi  et  aVs  (1994)  control.  The  cost  functional  ^{(p)  to  be  minimized  is 
then 

where  the  integrations  are  over  the  wall  (S)  in  space  and  over  a  short  duration  in  time 
At,  which  typically  corresponds  to  the  time  step  used  in  the  numerical  computation, 
and  ^  is  the  relative  price  of  the  control  since  the  first  term  on  the  right-hand  side 
represents  the  cost  of  the  actuation  (f>.  Note  that  there  is  a  minus  sign  in  front  of 
the  second  term  since  we  want  to  maximize  the  pressure  gradient.  It  should  be  noted 
that  the  spanwise  pressure  gradient  at  the  wall  will  be  eventually  reduced  when  the 
strength  of  the  near-wall  streamwise  vortices  is  reduced  through  successful  control. 
Here,  blowing  and  suction  that  increase  the  spanwise  pressure  gradient  for  the  next 
step  are  sought  as  a  suboptimal  control. 

To  minimize  the  cost  functional,  we  first  define  the  differential  states  of  the  velocity 
and  pressure,  (0i,p),  using  a  Frechet  differential  (Finlayson  1972), 


,  X 


(2.5) 

(2.6) 


where 


e-fO  €  ’ 


(2.7) 


$  being  an  arbitrary  perturbation  field  to  (p. 

Next,  we  choose  the  Crank-Nicolson  scheme  for  the  linear  terms  and  a  Runge- 
Kutta  scheme  for  the  nonlinear  terms  to  yield  a  discretized  form  of  (2.1)  and  (2.2): 


<'  2R  ft'  +  2  T' 

aRc  oxjdxj  2  dxi 

(2.8) 

du1+^ 

O 

11 

(2.9) 

ur\  =  <P5a, 

(2.10) 

where  the  superscripts  «-|- 1  and  n  denote  the  time  step,  and  R"  includes  the  nonlinear 
terms  and  the  explicit  parts  of  the  pressure  gradient  and  viscous  terms.  The  Frechet 
differential  of  (2.8)-(2.10)  yields  the  governing  equations  for  the  differential  states 
(Oup), 


At  8^0"+^ 

2Re  dxjdxj  2  5x,- 


(2.11) 


86]+^ 

dxj 


=  0, 


(2.12) 
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with 


(2.13) 

Note  that  {SiR”/^(f>)  $  =  0.  Hereinafter,  we  drop  the  superscript  n+l  and  all  variables 
are  understood  to  be  at  the  (n  +  l)th  time  step.  Note  that  there  is  no  contribution 
from  the  nonlinear  terms,  thus  making  the  equations  linear.  Generally,  the  suboptimal 
formulation  depends  on  the  time  advancement  scheme  used,  as  shown  here  (see  also 
Choi  et  al.  1993).  Dropping  the  nonlinear  terms  may  miss  important  flow  dynamics. 
However,  we  found  from  our  numerical  tests  with  the  full  nonlinear  terms  included 
that  the  contribution  from  the  nonlinear  terms  is  negligible  in  our  boundary  control 
with  short  optimization  interval  At;  it  turns  out  that  the  conservation  of  mass  due  to 
the  wall  actuation  dominates  the  near-wall  dynamics. 

Under  the  approximation  that  IRe/ht  >  It},  where  k  =  (kl  +  and  k*  and 
fcj  denote  the  streamwise  and  spanwise  wavenumbers  in  the  x-  and  z-directions 
respectivelyf,  the  above  equations  have  the  following  solutions  in  the  semi-infinite 
domain  with  periodic  conditions  in  the  x-  and  z-directions  (see  the  Appendix) : 


ifcv  r 


eiiy)  =  (exp[-(2Re/At)>/2y]  -  e”''^) , 


k 

ik 


Oiiy)  =  (exp[-(2J?e/At)‘/^y]  -  e  , 


02(y)  =  ^eA 


(2.14) 

(2.15) 

(2.16) 
(2.17) 


where  Oj,p,  and  ^  are  the  Fourier  coefiicients  of  Q},p,  and  ^  respectively.  For  the 
channel  geometry,  we  originally  considered  both  walls  and  found  that  the  interaction 
between  two  walls  is  negligible  as  long  as  the  typical  wavelength  (~  In/k)  associated 
with  near-wall  structures  is  much  smaller  than  the  channel  width. 

The  Frechet  differential  of  the  cost  functional  (2.4)  becomes 


^/r_  ^ 

2(1)  ^  AAt 


p  pt+At 

Jsl  ' 


H 


r  dp 

Is  it  3z 

dt  dS. 


The  Fourier  representation  of  the  above  equation  is 


2(l>  dz 


dp_ 

dz 


(2.18) 


(2.19) 


where  the  hat  denotes  the  Fourier  coefficient,  and  the  superscript  *  denotes  the 
complex  conjugate.  From  (2.17),  dp/dz  can  be  expressed  in  terms  of  4>*, 


Sp 

dz 


2ik2 

-kM* 


Equation  (2.19)  then  reduces  to 


2<l> 


2fcj 

TKi 


(2.20) 


(2.21) 


t  It  can  be  shown  that  ~  >  1.  Here  we  used  uAt/Ax  ~  0(1)  and 

knux  ~k,i  ~  R}'*,  where  fc,  is  the  wavenumber  corresponding  to  the  Kolmogorov  length  scale. 
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Rgxjre  3.  Correlation  between  a  spanwise  wall-shear  stress  of  a  no-control  case  and  a  spanwise 
wall-shear  stress  modified  by  a  control  based  on  =  10.  A  line  indicating  no  change  of  the 
spanwise  wall-shear  stress  field  is  drawn  for  guidance. 


A 


For  an  arbitrary  equation  (2.21)  should  be  satisfied,  yielding 


S>(l) 


/I  ^2  - 


(2.22) 


From  the  requirement  that  the  Frechet  differential  of  the  cost  functional  be  minimized, 
i.e.  2/ /2i4>  =  0,  the  optimum  ^  then  becomes 


^  =  Cfp„,  (2.23) 

where  C  is  a  positive  scale  factor  that  determines  the  cost  of  the  actuation.  Equation 
(2.23)  indicates  that  the  optimum  wall  actuation  is  negatively  proportional  to  the  sec¬ 
ond  spanwise  derivative  of  the  wall  pressure,  with  the  high-wavenumber  components 
reduced  by  1/k. 

Another  wall  quantity  that  indicates  similar  changes  of  the  near-wall  dynamics  due 
to  the  altered  pressure  field  is  the  spanwise  shear  at  the  wall,  dwidy.  Owing  to  the 
added  pressure  gradient  in  the  spanwise  direction  below  the  streamwise  vortex,  the 
spanwise  flow  near  the  wall  is  also  induced,  thus  increasing  the  spanwise  shear  stress 
at  the  wall  (see  figure  1).  The  scatter  plot  between  the  spanwise  shear  stress  at  the 
wall  from  two  different  fields,  one  of  which  is  from  an  unperturbed  channel  and  the 
other  with  the  control  based  on  y+  =  10,  is  shown  in  figure  3.  Thus  another  choice 
for  the  cost  functional  to  be  minimized  is 


/(^)  = 


^  f  2 
2AAtJsl  ^ 


dt  dS- 


1  f 
2AAt  Js  Jt 


dt  dS  . 


(2.24) 
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Rgxjre  4.  Correlation  between  a  pressure  field  of  a  no-control  case  and  a  pressure  field  modified 

by  a  control  based  on  equation  (2.23). 


Following  the  procedure  that  led  to  equation  (2.23)  yields  the  optimum  actuation: 


^  ikz  dw 


(2.25) 


Equation  (2.25)  indicates  that  the  optimum  wall  actuation  should  be  proportional  to 
the  spanwise  derivative  of  the  spanwise  shear  at  the  wall,  with  the  high-wavenumber 
components  reduced  by  1/k.  Note  that  the  scale  factor  C  in  equations  (2.23)  and 
(2.25)  is  arbitrary.  As  mentioned  before,  the  suboptimal  formulation  depends  on  the 
time-advancement  scheme  used.  However,  if  we  used  a  different  implicit  scheme,  only 
the  resulting  constant  C  would  be  different.  This  does  not  cause  a  problem  since  C 
is  chosen  such  that  the  r.m.s.  value  of  the  wall  actuation  is  maintained  at  a  given 
value. 

In  the  following  simulations,  we  set  the  r.m.s.  value  of  <l>  to  be  equal  to  that  of  the 
wall-normal  velocity  at  y+  =  10,  which  gives  the  same  r.m.s.  value  of  wall  actuations 
as  that  of  Choi  et  al.  (1994). 


3.  Results 

The  above  control  laws  (2.23)  and  (2.25)  are  tested  in  a  turbulent  channel  flow. 
The  pressure  gradient  at  the  wall  is  monitored  to  see  if  the  equation  (2.23)  control 
increases  the  pressure  gradient  when  the  control  law  is  applied.  It  behaves  as  expected, 
as  shown  in  figure  4.  The  spanwise  shear  stress  at  the  wall  also  increases  when  the 
equation  (2.25)  control  is  applied  (see  figure  5).  These  results  also  confirm  that  a 
suboptimal  procedure  without  the  nonlinear  terms  does  not  cause  an  error  for  our 
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Rgure  5.  Correlation  between  a  spanwise  wall-shear  stress  of  a  no-control  case  and  a  spanwise 
wall-shear  stress  modified  by  a  control  based  on  equation  (2.25). 


boundary  controls.  Since  these  control  laws  specifically  increase  the  wall  pressure 
gradient  or  spanwise  wall-shear  stress,  correlations  between  these  variables  before 
and  after  control  are  much  higher  than  those  corresponding  to  the  control  based  on 
y"*"  =  10  information.  This  suggests  that  the  controls  based  on  equations  (2.23)  and 
(2.25)  modify  the  flow  in  a  different  manner  from  the  control  based  on  information 
at  y'^  =  10. 

Time  histories  of  the  mean  streamwise  wall-shear  stress  for  different  control  laws 
are  shown  in  figures  6  and  7.  As  control  begins,  an  immediate  drop  in  the  shear  stress 
is  observed  for  all  cases.  At  the  same  expense  (i.e.  the  same  r.m.s.  value  of  (f>),  the 
control  based  on  (2.25),  which  reduces  drag  by  as  much  as  22%,  is  apparently  more 
effective  than  that  based  on  (2.23)  which  reduces  drag  by  16%.  This  indicates  that 
the  spanwise  wall-shear  stress  is  a  better  quantity  for  control  input. 

The  control  laws  presented  above,  however,  are  still  impractical  to  implement,  since 
they  are  expressed  in  terms  of  the  Fourier  coeflScients  (i.e.  in  wavenumber  space), 
which  require  information  over  the  entire  spatial  domain.  Therefore,  the  inverse 
transforms  of  k^/k  and  i/Cj/fc  are  sought  numerically  so  that  the  convolution  integral 
can  be  used  to  express  the  control  laws  in  physical  space.  The  discrete  representation 
of  each  control  law  then  becomesf : 

(t>{xj,  Zfc)  =  C  53  (3-1) 

/  k’ 

t  Since  k^/k  and  \kz/k  are  not  periodic  in  the  wavenumber  space,  a  high  resolution  was  used  to 
obtain  and  WJ[  to  minimize  aliasing  error. 
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k(=  zjlSz) 

0  1  2  3  4  5  6 

0  1.0000  0.4427  0.0031  0.0440  0.0044  0.0138  0.0032 

ji=xlAx)  1  0.0413  .  0.0007  0.0040  0.0037  0.0029  0.0021  -0.0016 

2  .  0.0110  0.0057  0.0019  0.0011  0.0002  0.0000  -0.0003 

Table  1.  The  weight  distribution  of  equation  (3.1),  The  weights  are  symmetric  between  j  and 
— j  and  k  and  — fe.  The  weights  are  normalized  by  W^.  Bold  faced  weights  are  used  in  the  calculation 
of  figure  6.  Here,  Ax'^  =  40  and  Az"*"  =  13  (Ax  =  3Az). 


/c(=  z/Az) 

0  1  2  3  4  5  6 

0  0.0000  1,0000  0.1039  0.2679  0.0852  0.1419  -0.0671 

;(=x/Ax)  1  0,0086  0.0537  0.0503  0.0310  0.0340  0.0148  0.0237 

2  0.0001  -0.0104  0.0059  0.0051  0.0100  U0074  0.0092 

Table  2.  The  weight  distribution  of  equation  (3.2),  WJ^.  The  weights  are  symmetric  between  j  and 
— J  and  antisymmetric  between  k  and  —k.  The  weights  are  normalized  by  Wq^.  Bold  faced  weights 
are  used  in  the  calculation  of  figure  7.  Here,  Ax'^  =  40  and  Az"^  =  13  (Ax  =  3Az). 


and 


cl>ixj,z,)  =  Cj^J2w]^,, 


dw 

dy 


Zfc+fc'), 


(3.2) 


respectively.  The  subscripts,  j  and  k,  denote  the  discretizing  indices  in  the  x-  and 
z-directions  respectively.  The  weights,  and  WJi^,  are  given  in  tables  1  and  2.  Note 
that  the  weights  decay  rapidly  with  distance  from  the  point  of  interest,  suggesting  that 
the  optimum  actuation  can  be  obtained  by  a  local  weighted  average  of  the  pressure  or 
spanwise  shear  stress.  The  results  obtained  using  a  37-point  average  for  the  pressure 
and  an  11-point  average  for  the  spanwise  shear  stress  yield  about  the  same  drag 
reduction  as  that  obtained  from  full  integration  using  equations  (2.23)  and  (2.25)  (see 
figures  6  and  7).  The  weights  used  in  these  calculations  are  bold-faced  in  tables  1 
and  2.  It  is  remarkable  that  localized  information  can  produce  such  a  significant  drag 
reduction,  especially  for  the  control  with  the  spanwise  shear  stress. 

The  weight  distribution  for  y  =  0  is  very  similar  to  the  one  found  in  the 
application  of  neural  networks  to  the  same  turbulent  flow  (Lee  et  al.  1997),  in  which 
only  a  single  strip  of  the  spanwise  shear  information  was  used  in  the  network.  The 
control  law  of  Lee  et  al., 


dw 
\kz\  dy 


(3.3) 


produces  almost  the  same  distribution  of  blowing  and  suction  as  the  present 
results.  Note  that  blowing  and  suction  from  equation  (2.25)  are  nearly  the  same 
as  those  from  equation  (3.3)  because  the  near-wall  structures  have  relatively  slow 
variation  in  the  streamwise  direction  (the  fcx  =  0  component  is  dominant).  This  blow¬ 
ing  and  suction  distribution  is  also  very  similar  to  the  one  based  on  =  10  data 
(see  Lee  et  al.  1997). 

In  figure  8,  contours  of  the  streamwise  vorticity  in  a  cross-flow  plane  for  the  control 
with  equation  (2.25)  are  compared  with  a  no-control  case.  Significant  reduction  in 
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Rgure  6.  Mean  streamwise  wall-shear  stress  for  various  control  laws  based  on  the  wall  pressure, 
compared  to  the  no-control  case:  thick  solid  line,  the  control  law  expressed  in  Fourier  space, 
equation  (2.23);  thin  solid  line,  the  control  law  expressed  in  physical  space,  equation  (3.1)  with  the 
integration  radius  of  6Az.  At"*"  0.2. 


figure  7.  Mean  streamwise  wall-shear  stress  for  various  control  laws  based  on  the  spanwise 
wall-shear  stress  compared  to  the  no-control  case:  thick  solid  line,  the  control  law  expressed  in 
Fourier  space,  equation  (2,25);  thin  solid  line,  the  control  law  expressed  in  physical  space,  equation 
(3.2)  with  1 1  points  in  the  spanwise  direction  only.  At'^  0.2. 


the  strength  of  the  streamwise  vortices  is  evident.  This  again  confirms  the  notion 
that  a  successful  manipulation  of  the  near-wall  streamwise  vortices  can  lead  to  drag 
reduction.  The  mean  streamwise  velocity  and  root-mean-square  velocity  near  the  wall 
are  shown  in  figures  9  and  10,  respectively,  and  compared  with  the  no-control  case. 
The  trends  are  very  similar  to  Choi  et  al  (1994)  and  Lee  et  al  (1997). 

Finally,  we  mention  that  we  tried  to  find  a  feedback  control  scheme  minimizing  the 
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ia) 


ib) 


Rgure  8.  Contours  of  the  streamwise  vorticity  in  a  cross-flow  plane:  (a)  no  control;  (b)  control 
based  on  wall-shear  stress  (equation  (2.25)).  The  contour  level  increment  is  the  same  for  both  figures. 
Negative  contours  are  dashed. 


Rgure  9.  The  mean  streamwise  velocity  normalized  by  Thick  solid  line,  control  law  (2.25) 
dashed  line,  no  control.  For  the  controlled  case,  the  velocity  is  normalized  by  the  controlled  Uj 
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Rgure  10.  The  root-mean-square  velocity  normalized  by  «,  of  the  uncontrolled  flow:  thick  solid 
line,  control  law  (2.25);  dashed  line,  no  control. 


following  cost  functional: 


2^At 


p  pt+At 

Jsl  ' 


dt  dS  -|- 


2AAt 


JsJt  \^y)y, 


dt  AS, 


(3.4) 


with  m  =  1  or  2.  This  cost  functional  is  the  most  natural  choice  since  it  contains 
the  quantity  directly  related  to  drag.  We  followed  the  same  procedure  to  derive  the 
following  control  schemes: 


<^  =  0, 


(3.5) 


for  m  =  1,  and 


k  dy 


W 


(3.6) 


for  m  =  2.  Equation  (3.5)  obviously  does  not  reduce  drag  and  equation  (3.6)  increased 
drag  in  our  numerical  simulations.  This  failure  appears  to  be  due  to  the  neglect  of 
the  nonlinear  terms  in  our  formulation,  since  the  numerical  solution  of  the  optimum 
wall  actuations  with  the  full  nonlinear  terms  gave  different  results  (Bewley  &  Moin 
1994).  This  suggests  that  even  in  a  short  time  interval  the  nonlinear  terms  should 
be  included  in  the  suboptimal  formulation  when  drag  itself  is  chosen  as  the  cost 
functional.  Manipulation  of  the  streamwise  vortices  by  wall  actuation,  however,  can 
be  accomplished  through  a  linear  process  as  shown  in  the  previous  section.  It  appears 
that  having  a  term  that  is  directly  related  to  near-wall  streamwise  vortices  in  the  cost 
functional  indirectly  includes  the  nonlinear  effect.  This  is  probably  related  to  the  fact 
that  near-wall  streamwise  vortices  are  self-sustained  by  a  nonlinear  process  (Hamilton, 
Kim  &  Waleffe  1995).  If  the  nonlinear  terms  are  included,  of  course,  it  is  impossible 
to  derive  the  feedback  control  law  in  closed  form  without  approximation.  Hill  (1993), 
on  the  other  hand,  considered  the  nonlinear  term  by  modelling  the  near-wall  flow 
using  the  Taylor  series  expansion  and  presented  a  feedback  control  law  in  closed 
form,  which  resulted  in  about  15%  drag  reduction. 
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4.  Summary 

We  have  obtained  two  simple  feedback  control  laws  for  drag  reduction  by  applying 
a  suboptimal  control  procedure  to  a  turbulent  flow.  This  was  possible  since  we  selected 
two  cost  functionals  guided  by  the  successful  control  based  on  quantities  monitored 
at  y+  =  10.  The  present  control  laws  perform  very  well,  resulting  in  substantial 
drag  reductions  when  applied  to  a  turbulent  channel  flow.  More  convenient  control 
schemes  requiring  only  local  information  were  also  derived,  and  were  shown  to  work 
equally  well.  They  require  quantities  measurable  only  at  the  surface  and  thus  should 
be  easier  to  implement  in  practice.  The  present  results  further  substantiate  the  notion 
that  a  successful  manipulation  of  the  near-wall  streamwise  vortices  is  the  key  to 
boundary-layer  control  for  drag  reduction. 

We  are  grateful  to  Dr  Gary  Coleman  and  Mr  Hong  Zhang  for  useful  discussions. 
This  work  was  supported  by  AFOSR  grant  F49620-95-10263  (Program  Managers, 
Drs  James  M.  McMichael  and  Marc  N.  Glauser).  Computer  time  was  provided  by 
the  NAS  Program  of  NASA  Ames  Research  Center,  the  San  Diego  Supercomputer 
Center,  and  the  DoD  High  Performance  Computing  Center.  H.C.  acknowledges  the 
support  by  KOSEF  through  Turbo  and  Power  Machinery  Research  Center. 


Appendix.  Derivation  of  the  solution  (2.14)-(2.17) 


Equations  for  the  differential  states  (0,-,p),  (2.11),  (2.12)  can  be  rewritten 
of  the  Fourier  coeflBcients, 

in  terms 

^  At  f  ;  2^  ^  ^ 

(Al) 

(A  2) 

(A3) 

i^x^i  +  ikz03  -l-  =  0, 

dy 

(A4) 

with 

0,(0)  =  $da,  §iioo)  =  0, 

(A  5) 

where  0(  and  p  are  defined  as  follows: 

(A  6) 

(A  7) 

kx  fcr 

The  operation  ik*-  (Al)  -|-d/dy(A2)  -t-ikz-  (A3)  together  with  equation  (A 4)  yields 


=  (A8) 

(A  9) 


which  has  a  non-growing  solution, 
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with  a  wall  value,  p„,  which  will  be  determined  later.  Using  (A  9),  we  can  find  solutions 
to  equations  (Al),  (A 2),  (A3), 

Giiy)  =  yiA:*Pw  (exp[-(/c^  +  IRe/AtY^^y]  -  e"*'^) ,  (A  10) 

Hy)  =  (exp[-(/c^  +  2Re/AtY'^y]  -  e"*^) ,  (A  11) 

Hy)  =  -  y exp[-(fc^  +  2Re/Aty^^y]  +  (A  12) 

With  these,  equation  (A  4)  reduces  to 


($  -  y^Pw^  -  y^^Pt*-^  exp[-(/c2  +  2Re/Aty^^y]  =  0.  (A  13) 
Since  2Ref  At  >  k^,  equation  (A  13)  yields 


A 


Pw  — 


(A  14) 


With  this,  equations  (A  10),  (All),  (A  12),  (A 9)  become  equations  (2.14),  (2.15),  (2.16), 
(2.17),  respectively. 
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ABSTRACT 

We  introduce  a  new  approach  that  alters  the  local  flow 
condition  using  electrostatically  driven  microactuator 
moving  in  the  in-plane  direction  such  that  form  drag  of 
the  actuator  can  be  eliminated.  This  is  in  contrast  to  the 
electromagnetically  driven  microflap  moving  normal  to 
the  substrate.  A  60  pm  X  200  pm  plate  moving  parallel 
to  the  substrate  surface  induces  a  “spanwise  velocity” 
into  the  flow  field.  This  spanwise  velocity,  when 
applied  to  the  near-wall  streaks,  increases  the  transport 
of  high-speed  fluid  away  from  the  wall,  therefore 
causing  reduction  in  viscous  drag.  The  microplate  is 
attached  at  the  end  of  a  microcantilever  capable  of,  even 
in  non-resonance,  large  tip  deflection  (>  100  pm),  tested 
at  the  operation  frequencies  of  500-1200  Hz.  The 
cantilever  is  of  a  high-aspect-ratio  structure  (2  pm  wide, 
6-17  pm  thick  silicon)  to  ensure  parallel  motion  over  a 
long  distance  and  provide  robustness  against  out-of- 
plane  deflection  under  external  disturbances  from  the 
flow.  We  report  the  design  and  one-mask  fabrication  of 
the  in-plane  microactuator  array  made  from  Silicon-On- 
Insulator  (SOI)  wafers  and  experimental  verification  of 
the  induced  Stoke’s  flow  and  a  local  fluid  flow. 

INTRODUCTION 

Most  of  the  work  done  in  active  control  of  fluid  flow  has 
been  performed  in  macroscale  [1];  it  involved  changing 
the  skin  friction  by  wall  motion,  heating  and  cooling  the 
wall,  and  injecting  mass.  The  key  issues  that  arise  are  the 
spatial  resolution,  operational  frequency  of  the  actuator, 
and  the  power  consumption  of  such  devices.  For  these 
reasons,  MEMS  actuators  have  become  a  very  attractive 
option  for  active  control  of  fluid  flow  [2].  They  provide 
excellent  spatial  resolution  and  high  operational 
frequency  with  low  power  consumption. 

Active  control  of  “streaks”  (high  speed  fluid 
approaching  the  wall  region)  in  microscale  has  been 
shown  feasible  only  recently  [2-5].  The  work  involved 
magnetically  actuated  microflaps  moving  in  the  out-of- 
plane  direction.  This  actuator  produced  an  upward 


velocity  against  streaks,  thereby  reducing  the  viscous 
drag  caused  by  the  high  speed  fluid.  Since  this  actuator 
was  of  millimeter  scale  in  overall  size  and  magnetically 
driven,  the  operational  frequency  was  limited  and  the 
power  consumption  was  relatively  high  (mW). 

The  goal  of  this  paper  is  to  realize  an  in-plane 
microactuator  that  can  be  used  to  generate  a  local  fluid 
field,  which  will  eventually  serve  to  reduce  viscous  drag 
in  fluid  flow.  The  device  is  made  flush  to  the  substrate 
and  operates  in  the  in-plane  direction  to  be  free  from 
“form  drag”  (drag  induced  due  to  a  physical  presence  of 
an  object  obstructing  a  flow),  in  contrast  to  the 
microflaps  [4].  The  reported  in-plane  microactuator  is 
electrostatically  driven  and  is  much  smaller  in  size, 
therefore  allowing  for  lower  power  consumption  (<  1 
pW)  and  higher  operating  frequency.  The  in-plane 
microactuator  is  approximately  100  times  smaller  than 
the  magnetic  microflap  actuator. 

THEORY  AND  MODELING 

Flow  Theory 

The  formation  of  regions  that  contribute  to  viscous  drag 
is  due  to  high  speed  fluid  streaks  moving  along  the  wall. 
This  creates  a  region  of  high  viscous  drag.  This 
phenomenon  occurs  generally  due  to  a  presence  of  a 
vortex  pair.  As  shown  in  Fig.  1,  high  speed  fluid 
approaches  the  wall  region  due  to  a  counter  rotating 
vortex  pair.  It  is  expected  that  the  fluid  flow  generated 
to  counter  the  downward  motion  will  reduce  the  viscous 
wall  drag. 

The  physical  sizes  of  the  streaks  are  a  function  of 
Reynolds  number.  For  the  case  where  Reynolds  number 
is  10,000  (a  parameter  similar  to  test  condition  of  the 
magnetic  microflap  [3]),  the  streaks  have  an  average 
width  of  900  pm  and  an  average  length  of  30  mm.  With 
these  streak  dimensions,  three  criteria  for  the  in-plane 
microactuator  are  required.  The  first  is  that  the  actuator 
end  plates  be  as  long  as  possible  in  order  to  interact  with 
as  much  of  the  streak  (from  their  length  point  of  view). 


Secondly,  the  actuators  should  produce  large  enough 
plate  displacement  in  order  to  affect  the  streaks  (from 
their  width  point  of  view);  otherwise,  they  would  not 
have  an  effect  on  the  streaks.  Thirdly,  the  induced  flow 
velocity  also  must  be  large  enough  to  interfere  with  the 
down  flow  of  the  incoming  high  speed  fluid.  This 
dictates  that  the  actuators  need  to  operate  at  high 
frequency. 


InduocdfkHv 


Fig.  1  Schematic  of  streak  formation 

Previous  modeling  effort  [6]  and  data  from  the  magnetic 
microflap  actuator  [3]  suggest  an  induced  velocity  on  the 
order  of  1-10  cm/s  to  be  sufficient  to  cause  substantial 
reduction  in  viscous  drag  (>  20%).  To  satisfy  these 
three  criteria,  our  actuators  are  made  to  move  long  end 
plates  (200  pm  long)  a  distance  over  100  pm  with  high 
operating  frequency  (0.5-1 .2  kHz). 

Design  and  Dynamic  Modeling  of  the  Actuator 

Our  actuators  are  electrostatically  driven  using  angled 
electrodes  [7].  Embedded  in  these  electrodes  are 
isolation  bumpers  that  are  40  pm  X  40  pm  in  size.  The 
bumpers  serve  to  prevent  shorting  between  the  cantilever 
beam  and  the  pulling  electrode.  Their  size  allows  for  a 
timed  release  of  the  actuators  while  still  insuring  the 
isolation  bumpers  remain  anchored  to  the  substrate.  The 
actuator  consists  of  a  large  end  plate  (60  pm  wide,  200 
pm  long)  attached  at  the  end  of  a  long  cantilever  beam 
(500-800  pm).  The  end  plates  have  release  holes  that 
are  6  pm  X  6  pm.  This  size  allows  for  short  timed  HF 
release  step.  The  supporting  cantilever  is  2-4  pm  wide 
and  6-17  pm  thick.  This  gives  a  high-aspect-ratio  device 
that  is  rigid  in  the  out-of-plane  direction  and  flexible  in 
the  in-plane  direction.  The  actuators  are  configured  in 
an  array  format  in  pair  formation  as  shown  in  Fig.  2. 
This  formation  allows  for  maximum  induction  of  fluid 
flow.  They  are  also  designed  to  be  recessed  so  that  the 


top  surface  of  the  plates  and  the  electrodes  are  on  the 
same  plane  as  the  substrate  to  prevent  any  interference 
on  the  oncoming  fluid. 


Fig.  2  Design  layout  of  in-plane  microactuators 
(top  view) 

With  the  intention  of  operating  these  devices  at  large 
deflection  under  high  frequency  conditions,  we  need  to 
characterize  their  performance  that  encompasses  the 
resonance  modes  and  frequencies.  An  imderstanding  of 
the  effect  of  the  large  end  plate  on  the  dynamic  response 
of  the  actuators  is  necessary  for  the  successful  operation 
of  the  actuators. 

Finite  element  model  using  a  commercial  package 
(ANSYS)  is  done  to  determine  the  response  of  the 
cantilever-plate  system.  Both  a  modal  analysis  and  a 
harmonic  response  analysis  are  performed  for  the  case  of 
a  4  pm  wide,  20  pm  thick,  and  500  pm  long  actuator 
with  an  end  plate  that  is  60  pm  wide  and  200  pm  long. 
Fig.  3  shows  the  first  resonance  mode  of  the  actuator, 
occurring  at  2.5  kHz.  Up  to  the  actuator’s  fundamental 
frequency,  the  end  plate  follows  the  beam  without  any 
relative  rotation  to  the  cantilever  tip  and  does  not 
attenuate  the  amplitude. 


Fig.  3  Fundamental  mode  of  in-plane  microactuator 

The  second  mode  of  the  device  is  in  the  out-of-plane 
direction,  Fig.  4,  at  6.2  kHz.  The  third  mode  occurs  at 
34.7  kHz.  At  this  mode,  the  rotational  effect  of  the  end 
plate  keeps  the  overall  in-plane  motion  of  the  end  plate 
almost  stationary,  Fig.  5. 


Fig.  4  Second  resonance  mode  of  in-plane 
microactuator  (side  view) 


Fig.  5  Third  resonance  mode  of  in-plane 
microactuator 


FABRICATION 

Fabrication  of  the  in-plane  microactuators  is  done  on 
Silicon-On-Insulator  (SOI)  wafers  with  a  single  mask 
and  three  processing  steps.  The  SOI  wafers  are  custom- 
ordered  from  a  commercial  service  to  our  specifications 
in  order  to  meet  several  requirements.  The  first 
requirement  is  that  the  wafers  be  heavily  doped  on  both 
the  handle  wafer  (i.e.,  the  main  substrate)  and  the  device 
wafer  (i.e.,  top  layer).  This  is  needed  because  no 
metalization  process  will  be  performed.  Therefore,  in 


order  to  have  a  driving  signal  carried,  the  silicon  must  be 
both  a  structural  material  and  a  conductive  base.  The 
second  requirement  is  that  the  sacrificial  oxide  on  the 
SOI  wafer  be  thick  (--3  pm)  to  help  releasing  the  large 
end  plates  and  maintain  enough  gap  from  the  substrate 
over  the  long  travel  during  operation.  Electrical 
breakdown  through  the  oxide  layer  under  high  actuation 
voltage  is  yet  another  consideration.  Finally,  the 
thickness  of  the  top  silicon  (device  layer)  is  required  to 
vary  from  6  pm  to  18  pm  in  order  to  fabricate  high- 
aspect-ratio  structures  with  narrow  beam  width  (as  small 
as  2  pm  ). 

The  fabrication  begins  by  using  a  Bond  and  Etch  Back 
Silicon-On-Insulator  (BESOI)  wafer,  Fig.  6  (a).  The  SOI 
wafer  is  made  from  two  highly  doped  (100)  4  inch 
wafers  (0.005  Q-cm).  Wet  oxide  of  1.5  pm  is  grown  on 
each  wafer.  The  wafers  are  then  bonded,  oxide  to  oxide, 
at  room  temperature.  They  are  then  placed  in  an  800 
fiimace  to  strengthen  the  bond.  On  completion  of  the 
annealing,  one  side  of  the  bonded  wafer  undergoes 
grinding  to  the  target  device  thickness.  Once  the  target 
thickness  is  reached,  the  wafers  are  polished  to  give  a 
finished  surface. 

After  photoresist  is  patterned  to  define  the  device,  the 
wafers  are  sent  to  MCNC  [8]  for  a  deep  reactive-ion  etch 
(DRIE)  which  is  performed  through  the  device  layer. 
Oxide  is  an  effective  etch  stop  layer  for  the  DRIE,  but 
end  point  detection  was  also  used  to  accommodate  for 
the  varying  thickness  of  our  device  layers.  Fig.  6(b). 
The  photoresist  is  then  removed  in-house  and  the  wafers 
are  diced  to  1  cm  X  1  cm  chips  which  are  used  for  actual 
flow  experiments.  Finally,  the  test  chips  undergo  a 
release  process.  Fig  6(c).  The  release  process  is  a  timed 
etch  of  the  sacrificial  oxide.  The  Oxide  underneath  the 
actuators  is  removed  in  8  minutes  of  49%  HF.  All  the 
other  structures  have  significantly  larger  anchor  areas, 
therefore  allowing  them  to  remain  anchored  after  the  end 
plates  have  been  released. 

Releasing/drying  was  a  challenge,  despite  the  thickness 
of  the  beam  (over  10  pm),  due  to  the  length  (up  to  1000 
pm  overall)  and  the  existence  of  large  plates  at  the  end. 
The  standard  CO2  supercritical  drying  [9]  and  surface 
passivation  by  self-assembled  monolayer  (SAM)  coating 
[10],  both  proven  effective  for  simple  beams  [11], 
produced  a  dismal  result  (<  10%  yield)  for  the  in-plane 
microactuators.  For  the  CO2  drying,  it  is  speculated  that 
the  difficulty  stemmed  from  replacing  the  methanol 
under  the  large  plates  with  the  CO2  liquid  through  the 
narrow  and  deep  trenches  made  by  DRIE.  Similar 
problems  also  make  the  SAM  coating  method 
ineffective. 
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Fig.  6  Flow  Process  (a)  SOI  wafer,  (b)  DRIB  on 
device  layer,  (c)  releasing  structure  by  timed  etch 

A  dramatic  improvement  has  been  found  when  the  chip 
was  loaded  “upside  down”  on  completion  of  the 
methanol  soak,  apparently  helping  the  methanol  drip  by 
gravity  in  lighter  CO2  liquid.  Rinsing  times  are  also 
increased  to  provide  longer  time  for  diffusion.  The 
release  process  consists  of  8  min  in  49%  HF  followed  by 
30  minutes  DI  rinse  in  three  successive  baths  (10 
minutes  each).  The  chips  are  then  transferred  to 
methanol  and  kept  for  yet  another  30  minutes  before 
CO2  drying.  Twenty  rinses  are  done  with  a  dwelling 
time  of  3  minutes  in  between  each  rinse.  This  release 
method  yielded  a  success  close  to  100  %. 


(b) 

Fig.  7  SEM  pictures  of  the  fabricated  in-plane 
microactuators  (a)  a  pair  of  microactuators 
(b)  close-up  view  near  the  cantilever  end  and  the  plate 

Fig.  7  (a)  shows  a  pair  of  the  released  actuator  with  the 
angled  electrode  configuration  used  to  drive  it.  As 
shown,  the  actuators  are  designed  to  be  recessed  to 
prevent  an  unwanted  effect  on  the  on-coming  fluid  flow. 
In  addition,  this  design  serves  as  a  shield  for  the 
actuators  against  any  perturbation  in  the  flow  that  might 
cause  the  devices  to  break  off.  Fig  7  (b)  is  the  close-up 
showing  the  high-aspect-ratio  beam,  the  end  plate, 
release  holes  on  it,  angled  drive  electrodes,  and 
electrostatic  isolation  bumpers. 

TESTING 

Two  flow  experiments  are  performed.  The  first  is  to 
verify  that  the  actuator  is  capable  of  generating  a  local 
flow.  The  generated  flow  being  sought  for  is  Stake’s 
flow.  The  second  is  to  verify  that  the  actuators  do 
indeed  generate  a  net  upward  flow  that  can  be  used  to 
counter  the  flow  being  brought  in  by  a  vortex  pair.  In 
the  second  experiment,  flow  visualization  technique  is 
used. 

To  verify  Stake’s  flow,  a  hot-wire  anemometer  is  placed 
above  the  microactuator.  The  position  of  the  hot-wire 
above  the  end  plate  is  approximately  400  pm.  Fig.  8. 
The  hot-wire  is  turned  on  and  a  signal  is  recorded  before 
activating  the  actuator.  The  actuator  is  then  turned  on. 
The  actuator  is  made  to  displace  the  end  plate  100  pm 
peak-to-peak  at  a  frequency  of  500  Hz.  The  signal  from 
the  hot-wire  is  recorded.  Fig  9.  The  signal  picked  up  by 
the  hot-wire  shows  the  induced  Stake’s  flow  following 
the  frequency  of  the  actuators. 
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Fig.  8  Experimental  set-up  to  verify  induction  of 
Stoke’s  flow 
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Fig.  10  Experimental  set-up  used  to  visualize  induced 
flow  by  actuator  array 
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Fig.  9  Stoke’s  flow  detection  due  to  actuator  motion 
picked  up  by  hot-wire 

In  the  second  experiment,  smoke  is  introduced  into  an 
array  of  actuators.  A  laser  sheet  is  placed  on  top  of  the 
actuators,  Fig.  10.  The  entire  experiment  is  enclosed  in  a 
controlled  environment  that  allows  for  stagnant  fluid. 
Once  the  smoke  is  introduced  onto  the  actuators,  it  can 
be  seen  as  a  white  feature,  Fig.  11(a).  When  the 
actuators  are  turned  on,  the  smoke  is  seen  to  displace 
upwards  and  diffuse  away.  This  observation  was  clear 
during  direct  viewing.  The  flow  visualization  technique 
allowed  for  the  visual  verification  of  the  possible 
induced  flow. 


Fig.  11  Flow  visualization  of  induced  flow  by  actuator 
array 


SUMMARY 

High-aspect-ratio,  in-plane  microactuators  have  been 
designed,  fabricated,  and  tested  for  fluid  control.  The 
goal  is  to  let  the  microactuator  induce  a  fluid  flow  so 
that  the  induced  flow  ultimately  counters  the  high  speed 
fluid  generated  by  a  vortex  pair  moving  towards  a  wall 
region;  thus  reducing  viscous  drag.  The  reported  in¬ 
plane  microactuators  are  tested  to  perform  as  designed 
and  to  generate  a  local  fluid  flow.  The  induced  upward 
flow  was  verified  by  the  use  of  a  hot-wire  and  a  flow 
visualization  by  laser. 
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ABSTRACT 


The  successful  design,  modeling,  fabrication,  and  testing  of  high-aspect-ratio  large  deflection  in¬ 
plane  microactuators  are  presented.  The  large  displacement,  in-plane  actuator  has  a  unique 
application  in  the  area  of fluid  flow  control.  Unlike  previously  used  electromagnetically  actuated 
microflap  in  which  motion  was  normal  to  the  substrate,  we  introduce  a  novel  design  that  alters  the 
local  fluid  flow  by  moving  the  actuators  parallel  to  the  substrate.  This  new  approach  of  “in-plane 
motion  ”  allows  for  a  condition  free  of  “form  drag.  ”  Furthermore,  electrostatic  drive  allows  for 
lower  power  consumption  (pW).  The  actuator  includes  microplates  that  are  60  pm  X  200  pm. 
These  microplates,  when  moved  parallel  to  the  substrate  surface,  induces  a  “spanwise  velocity” 
into  the  flow  field  above  them.  This  induced  velocity  field,  when  applied  to  the  near-wall  streaks, 
would  increase  the  transport  of  high-speed  fluid  away  from  the  wall,  therefore  causing  reduction  in 
viscous  drag.  The  actuators  are  made  from  silicon-on-insulator  (SOI)  wafers  using  a  one  mask 
deep  reactive-ion-etching  (DRIB)  process.  The  microplates  are  suspended  by  a  high-aspect-ratio 
cantilever  structure  (2  pm  wide,  6-18  pm  thick  silicon)  to  ensure  robustness  against  any 
disturbances  in  the  out-of-plane  direction.  We  present  experimental  verification  of  both  the  induced 
Stoke  'sflow  and  a  local  fluid flow  created  by  our  in-plane  microactuators. 

Key  words:  Silicon-on-insulator,  high  aspect  ratio,  deep  reactive  ion  etching,  viscous  drag 
reduction,  in-plane  actuator,  large  deflection 
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INTRODUCTION 


Conventional  techniques  for  active  control  of  fluid  flow  consist  of  changing  the  skin  friction  by  wall  motion,  heating 
and  cooling  the  wall,  and  injecting  mass.  These  techniques  are  all  being  implemented  in  macroscale  [1].  However,  they 
suffer  from  severe  limitation  in  the  areas  of  spatial  resolution,  operational  frequency  of  the  actuator,  and  the  power 
consumption.  It  therefore  becomes  apparent  that  MEMS  actuators  offer  a  very  attractive  solution  for  active  control  of 
fluid  flow  [2].  The  inherent  nature  of  MEMS  devices  offers  excellent  spatial  resolution  and  high  operational  frequency 
with  low  power  consumption. 

Recently  published  work  on  magnetically  actuated  microflaps  moving  in  the  out-of-plane  direction  has  shown  the 
feasibilty  of  active  control  of  “streaks”  (high  speed  fluid  approaching  the  wall  region)  in  microscale  [2-5].  The  control 
was  done  by  displacing  a  microflap  in  the  out-of-^)lane  direction  so  as  to  produce  an  upward  velocity  against  the 
streaks.  This  induced  velocity  field  counteracts  a  streak,  thereby  reducing  the  viscous  drag  caused  by  the  high  speed 
fluid.  The  size  of  the  microflaps  used  was  of  millimeter  scale  and  magnetically  driven.  The  operational  frequency  is 
thus  limited,  and  the  power  consumption  was  relatively  high  (mW). 

The  goal  of  this  paper  is  to  realize  the  induction  of  a  local  fluid  flow  caused  by  in-plane  actuation  of  an  array  of 
electrostatically  driven  microdevices.  This  induced  local  fluid  flow  could  eventually  serve  to  reduce  viscous  drag.  The 
devices  are  made  flush  to  the  substrate  and  operate  in  the  in-plane  direction  to  be  free  from  “form  drag”  (drag  induced 
due  to  a  physical  presence  of  an  object  obstructing  a  flow).  This  is  in  contrast  to  the  microflaps  [4].  The  reported  in¬ 
plane  microactuators  are  electrostatically  driven  and  are  approximately  10  times  smaller  than  the  magnetic  microflap 
actuator,  allowing  for  low  power  consumption  (<  1  pW)  and  high  operating  frequency  (500-1000  Hz). 
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THEORY  AND  MODELING 


Flow  Theory 

Generally,  the  presence  of  a  counter-rotating  vortex  pair  gives  rise  to  a  region  of  high  viscous  drag,  Fig.  1.  These 
occurrences  are  known  as  “streaks”.  Streaks  bring  in  high  speed  fluid  downward  to  the  wall  region  increasing  drag.  It 
is  expected  that  the  viscous  wall  drag  will  be  reduced  if  an  upward  flow  is  induced  to  counteract  this  downward  motion 
of  the  high  speed  flow. 

The  physical  sizes  of  the  streaks  are  a  function  of  Reynolds  number.  For  this  paper  we  choose  a  Reynolds  number  that 
is  10,000  (a  parameter  similar  to  test  condition  of  the  magnetic  microflap  [3]).  With  a  Reynolds  number  of  10,000  the 
streaks  have  an  average  width  of  900  pm  and  an  average  length  of  30  mm.  These  physical  feature  sizes  of  the  streaks 
dictate  the  design  criteria  of  the  microdevice.  With  such  values  of  streak  width  and  length,  three  criteria  for  the  in-plane 
microactuator  are  required.  The  first  is  that  the  actuator  end  plates  be  as  long  as  possible  in  order  to  interact  with  as 
much  of  the  streak  (from  their  length  point  of  view).  Secondly,  the  actuators  should  produce  large  enough  plate 
displacement  in  order  to  affect  the  streaks  (from  their  width  point  of  view);  otherwise,  they  would  not  have  an  effect  on 
the  streaks.  Thirdly,  the  induced  flow  velocity  must  be  large  enough  to  interfere  with  the  down  flow  of  the  incoming 
high  speed  fluid.  The  last  condition  dictates  the  actuators  operate  at  relatively  high  frequency  (~1  kHz). 

An  important  parameter  obtained  from  previous  modeling  effort  [6]  and  data  from  the  magnetic  microflap  actuator  [3] 
suggest  an  induced  velocity  on  the  order  of  1-10  cm/s  to  be  sufficient  to  cause  substantial  reduction  in  viscous  drag  (> 
20%).  To  satisfy  the  criteria  imposed  by  the  flow  conditions,  our  actuators  are  made  to  move  long  microplates  (200  pm 
long)  a  distance  over  100  pm  with  high  operating  frequency  (0.5-1. 2  kHz). 
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Design  of  the  Actuator 


The  actuators  consist  of  large  microplates  (60  pm  wide,  200  pm  long)  attached  at  the  end  of  a  long  cantilever  beams 
(500-800  pm),  see  Fig.  2.  The  cantilever  beams  are  electrostatically  driven  using  angled  electrodes  [7,8].  Isolation 
bumpers  (40  pm  X  40  pm)  are  embedded  within  these  electrodes  to  prevent  shorting  between  the  cantilever  beams  and 
driving  electrodes.  The  size  of  the  bumpers  are  designed  to  be  large  enough  to  allow  for  a  timed  release  of  the  actuators 
while  still  ensuring  good  anchor  to  the  substrate.  The  length  of  the  supporting  cantilever  beams  range  from  500  to  800 
pm  and  width  and  thickness  range  from  2-4  pm  and  6-18  pm,  respectively.  These  dimensions  constitutes  a  high-aspect 
ratio  device  that  is  rigid  in  the  out-of-plane  direction  and  yet  flexible  in  the  in-plane  direction.  The  microplates  have 
within  them  6  pm  X  6  pm  etch  holes  to  control  the  HF  etching  time  for  release  process.  The  actuators  are  configured  in 
an  array  format  in  pair  formation  as  shovra  in  Fig.  2.  This  pair  formation  allows  for  maximum  induction  of  fluid  flow. 
They  are  also  designed  to  be  in  a  recessed  area  so  that  the  top  siuface  of  the  plates  and  the  electrodes  are  on  the  same 
plane  as  the  substrate  siuface  (wall)  to  prevent  any  interference  on  the  oncoming  fluid. 

Modeling  of  the  Actuator 

Electrostatic  performance,  resonance  modes,  and  frequencies  require  rigorous  analysis  due  to  the  fact  that  our  actuators 
are  operated  under  large  deflection  and  high  frequency  conditions.  An  understanding  of  the  effect  of  the  large  end  plate 
on  the  dynamic  response  of  the  actuators  is  also  necessary  for  the  successful  operation  of  the  actuators. 

Electrostatic  modeling  was  performed  via  the  coupling  of  a  commercial  finite  element  software  (ANSYS)  and  a  Fortran 
program.  The  material  properties  used  for  the  modeling  are:  Young’s  modulus  of  180  GPa,  density  of  2330  kg/m^  and 
Poisson’s  ratio  of  0.27.  Our  modeling  effort  was  compared  with  both  previous  experimental  data  [7]  and  data  that  was 
obtained  by  our  in-plane  actuators.  Fig.  3  shows  merit  of  predictability  of  our  model  with  respect  to  the  data. 
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Finite  element  model  using  ANSYS  is  done  to  determine  the  dynamic  response  of  the  cantilever-plate  system.  Both  a 
modal  analysis  and  a  harmonic  response  analysis  are  performed  for  the  case  of  a  4  pm  wide,  18  pm  thick,  and  500  pm 
long  actuator  with  an  end  plate  that  is  60  pm  wide  and  200  pm  long.  Fig.  4(a)  shows  the  first  resonance  mode  of  the 
actuator,  occurring  at  2.5  kHz.  Up  to  the  actuator’s  fundamental  frequency,  the  end  plate  follows  the  beam  without  any 
relative  rotation  to  the  cantilever  tip  and  does  not  attenuate  the  amplitude.  The  second  mode  of  the  device  is  in  the  out- 
of-plane  direction,  Fig.  4(b),  at  6.2  kHz.  The  third  mode  occurs  at  34.7  kHz.  At  this  mode,  the  rotational  effect  of  the 
end  plate  keeps  the  overall  in-plane  motion  of  the  end  plate  almost  stationary.  Fig.  4(c). 


FABRICATION 


Fabrication  of  the  in-plane  microactuators  is  done  on  commercially  available,  custom-ordered,  Silicon-On-Insulator 
(SOI)  wafers.  The  wafers  had  to  meet  three  constraints  imposed  on  them;  a  certain  level  of  conductivity,  thickness  of 
sacrificial  oxide,  and  the  thickness  of  the  top  silicon.  First,  the  high  conductivity  is  required  in  order  to  have  both  a 
driving  signal  on  the  top  silicon  and  a  ground  potential  on  the  bottom  silicon  carried  without  metalization.  Therefore 
the  wafers  needed  to  be  heavily  doped  on  both  the  handle  wafer  (i.e.,  the  main  substrate)  and  the  device  wafer  (i.e.,  top 
layer).  Second,  the  thickness  of  sacrificial  oxide  on  the  SOI  wafer  needs  to  be  thick  such  that  the  cantilever  and  end 
plates  are  released,  safety  bumpers  anchored,  and  the  gap  large  enough  to  ensure  long  travel  over  the  substrate  during 
operation.  Electrical  breakdown  through  the  oxide  layer  under  high  actuation  voltage  is  yet  another  consideration.  We 
used  3  pm  thick  oxide.  Third,  we  varied  the  thickness  of  the  top  silicon  (device  layer)  from  6  pm  to  18  pm  in  order  to 
fabricate  high-aspect-ratio  structures  with  narrow  beam  width  (as  small  as  2  pm). 

The  fabrication  consists  of  a  single  mask  and  three  processing  steps.  We  begin  by  using  a  Bond  and  Etch  Back  Silicon- 
On-Insulator  (BESOI)  wafer.  Fig.  5  (a).  The  SOI  wafer  is  made  from  two  highly  doped  (100)  4  inch  wafers  (0.005 
Q.cm).  Wet  oxide  of  1.5  pm  is  grown  on  each  wafer,  and  the  wafers  are  then  bonded,  oxide  to  oxide,  at  room 
temperature.  They  are  then  placed  in  an  800  °C  furnace  to  strengthen  the  bond.  On  completion  of  the  annealing,  one 
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side  of  the  bonded  wafer  undergoes  grinding  to  the  target  device  thickness.  Once  the  target  thickness  is  reached,  the 
wafers  are  polished  to  give  a  finished  surface. 

After  photoresist  is  patterned  to  define  the  device,  the  wafers  are  sent  to  MCNC  [9]  for  a  deep  reactive-ion  etch  (DRIE) 
which  is  performed  through  the  device  layer  to  the  sacrificial  oxide,  Fig.  5(b).  To  account  for  non-uniformity  across  the 
wafer,  a  15%  over  etch  was  used  in  combination  with  an  end-point  detection  system.  The  photoresist  is  then  stripped 
in-house  and  the  wafers  are  diced  to  1  cm  X  1  cm  dies  which  are  used  for  actual  flow  experiments.  Finally,  the  test 
chips  undergo  a  release  process.  Fig  5(c).  The  release  process  is  a  timed  etch  of  the  sacrificial  oxide.  The  oxide 
underneath  the  actuators  is  removed  in  8  minutes  of  49%  HF.  All  the  other  structures  have  significantly  larger  anchor 
areas,  therefore  allowing  them  to  remain  anchored  after  the  cantilevers  and  end  plates  have  been  released. 

Releasing/drying  was  a  challenge,  despite  the  thickness  of  the  beam  (over  10  pm),  due  to  the  length  (up  to  1000  pm 
overall)  and  the  existence  of  large  plates  at  the  end.  The  standard  CO2  supercritical  drying  [10]  and  surface  passivation 
by  self-assembled  monolayer  (SAM)  coating  [11,12],  both  proven  effective  for  simple  beams  [13],  produced  a  dismal 
result  (<  10%  yield)  for  the  in-plane  microactuators.  For  the  CO2  drying,  it  is  speculated  that  the  difficulty  stemmed 
fi-om  replacing  the  methanol  under  the  large  plates  with  the  CO2  liquid  through  the  narrow  and  deep  trenches  made  by 
DRIE.  Similar  problems  also  make  the  SAM  coating  method  ineffective. 

A  dramatic  improvement  has  been  found  for  the  CO2  drying  when  the  chip  was  loaded  “upside  down”  on  completion  of 
the  methanol  soak,  apparently  helping  the  methanol  drip  by  gravity  in  lighter  CO2  liquid.  Rinsing  times  are  also 
increased  to  provide  longer  time  for  diffusion.  The  release  process  consists  of  8  min  in  49%  HF  followed  by  30  minutes 
deionized  (DI)  water  rinse  in  three  successive  baths  (10  minutes  each).  The  chips  are  then  transferred  to  methanol  and 
kept  for  yet  another  30  minutes  before  CO2  drying.  Twenty  CO2  rinses  are  done  with  a  dwelling  time  of  3  minutes  in 
between  each  rinse.  This  release  method  provided  a  success  close  to  100%  yield. 
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Fig.  6  (a)  shows  a  pair  of  the  recessed  released  actuator  with  the  angled  electrode  configuration  used  to  drive  it.  The 
recess  serves  a  function  of  preventing  any  unwanted  effects  on  the  on-coming  fluid  flow.  In  addition,  this  design  type 
shields  the  actuators  against  any  perturbation  in  the  flow  that  might  cause  the  devices  to  break  off  Fig.  6  (b)  is  the 
close-up  showing  the  high-aspect-ratio  beam,  the  end  plate,  release  holes  on  it,  angled  drive  electrodes,  and  electrostatic 
isolation  bumpers. 


TESTING 


The  two  goals  of  this  work  were  to  verify  Stoke’s  flow  and  net  induced  local  flow  in  the  direction  opposite  to  the  flow 
being  brought  in  by  a  vortex  pair.  Verification  of  Stoke’s  flow  was  done  by  standard  micro  hot-wire,  while  that  of  the 
induced  flow  was  done  by  a  flow  visualization  technique. 

To  verify  Stoke’s  flow,  a  hot-wire  anemometer  is  placed  400  pm  above  the  microactuator’s  end  plate.  Fig.  7.  The  hot¬ 
wire  is  turned  on  and  a  signal  is  recorded  before  activating  the  actuator.  Then  the  actuator  is  turned  on  and  made  to 
displace  the  end  plate  100  pm  peak-to-peak  at  a  frequency  of  500  Hz.  While  the  signal  from  the  hot-wire  is  being 
recorded,  the  actuator  is  turned  off  again.  Fig.  8.  Fig.  8  clearly  indicates  that  the  signal  obtained  from  the  hot-wire  is  that 
of  the  induced  Stoke’s  flow  following  the  frequency  of  the  actuator. 

In  the  second  experiment,  a  smoke  generator  introduces  smoke  onto  a  pair  of  actuators.  A  laser  sheet  shining  through 
the  smoke  is  placed  on  top  of  the  actuators.  Fig.  9.  The  entire  experiment  is  enclosed  in  a  controlled  environment  that 
allows  for  stagnant  fluid.  Once  the  smoke  is  introduced  onto  the  actuators,  it  can  be  seen  as  a  white  feature.  Fig.  10  (a). 
Fig.  10  (b)  shows  a  zoomed  up  view  of  the  smoke  on  top  of  the  end  plates.  A  reflection  on  the  silicon  is  also  seen. 
When  the  actuators  are  turned  on,  it  was  clearly  seen  that  the  smoke  is  displaced  upward,  therefore  verifying  net 
induced  flow.  Fig.  10(c).  The  upward  flow  is  indicated  by  the  clear  separation  that  exist  between  the  smoke  and  the  die. 
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SUMMARY 


Successful  modeling,  fabrication,  and  flow  experiments  have  been  presented  in  this  paper.  Our  electrostatic  and  modal 
models  accurately  predict  the  performance  of  our  microactuators.  Using  SOI  wafers,  we  also  demonstrated  the  one 
mask  process  required  to  fabricate  the  microdevices.  In  addition,  the  realization  of  both  Stake’s  flow  and  a  net  induced 
flow  has  been  presented.  These  induced  flows  were  verified  by  the  use  of  a  hot-wire  and  a  flow  visualization  technique, 
respectively.  It  is  through  the  successful  design  and  fabrication  of  high-aspect-ratio,  in-plane  microactuators  that  made 
flow  experiments  possible.  The  unique  MEMS  design  enabled  a  large  enough  induced  flow  to  be  detected  via 
conventional  fluid  mechanics  tools  (hot-wire  and  flow  visualization).  The  next  step  is  to  let  the  microactuator  induce  a 
fluid  flow  so  that  the  induced  flow  ultimately  act  to  counter  the  high  speed  fluid  generated  by  an  artificial  vortex  pair 
moving  along  a  wall  region;  thus  verifying  reduction  in  viscous  drag. 
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FIGURE  CAPTIONS 


Fig.  1  Formation  and  counteraction  of  streaks 
Fig.  2  Design  layout  of  in-plane  microactuators 
Fig.  3  Pull-in  voltage  prediction 
Fig.  4  Modes  of  in-plane  microactuator 

(a)  Fundamental  mode  (top  view),  (b)  Second  resonance  mode,  (c)  Third  resonance  mode  (top  view) 
Fig.  5  Flow  Process  (a)  SOI  wafer,  (b)  DRIB  on  device  layer,  (c)  releasing  structure  by  timed  etch 
Fig.  6  SEM  pictures  of  the  fabricated  in-plane  microactuators 

(a)  a  pair  of  microactuators,  (b)  close-up  view  near  the  cantilever  end  and  the  plate 
Fig.  7  Experimental  set-up  to  verify  induction  of  Stake’s  flow 
Fig.  8  Detection  of  Stake’s  flow  (500  Hz) 

Fig.  9  Experimental  set-up  to  detect  induced  flow 
Fig.  10  Flow  visualization  of  induced  flow  by  actuator  pair 

(a)  Smoke  on  full  die,  (b)  Zoomed  view  of  smoke  on  top  of  actuator  pair  (actuator  is  off),  (c)  Upward 
displacement  of  smoke  when  actuator  is  turned  on 
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